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NOVEL HEDGEHOG-DERIVED POLYPEPTTOES 



BACKGROUND OF THE INVENTION 

1 . Field of the Inventim 

This invention relates generally to the field of protein processing and protein signalling 
pathways and specifically to two novel proteins having distinct activities, which are 
derived fix>m a conunon hedgehog protdn precursor. 

2. Description of the Related Art 

Embryologists have long performed experimental manipulations that reveal the striking 
abiUties of certain structures in vertebrate embryos to impose pattern upon surrounding 
tissues. Speculation on the mechanisms underlying these patterning effects usuaUy 
centers on the secretion of signaling molecule that eUcits an appropriate response from 
the tissues begin patterned. More recent work aimed at the identification of such 
signaling molecules implicates secreted proteins encoded by individual members of a 
small number of gene families. One such family of proteins which may have an 
influential effect upon patterning activities are those proteins encoded by the hedgehog 
genefiunily. 

The hedgehog (hh) gene was initially identified based on its requireraem for normal 
segmemal patterning in Drosophila (Nusslein-Volhard, C. & Wieschaus, E, Nature 
2S7:795-80l, 1980). Its fiinctions include local agnaling to coordinate the identities of 
adjacent cells within early embryonic segments (Hooper. J.E., & Scott, M P. Early 
Embryonic Development of Animals, pp. 1-48. 1992) and a later function in cuticle 
patterning that extends across many ceU diameters (Heemskerk. J. & DiNardo, S., Cell, 
7^:449.460. 1994). The hh gene also functions in the patterning of imaginal precursors 
of adult structures, including the appendages and the eye (Mohler, J. Genetics. 12Q: 1061- 



wo 96/17924 



PCTAIS95/1S463 



-2- 

1072, 1988;N4a, e/a/.. Cell, 7i:927.938. 1993; Heberlcin, etaL. Cell, 71:913-926, 1993; 
Tabata, T. & Kornbcrg. T.D., Cell, 76:89-102. 1992; Basler, K. & Stnihl, G., Nature, 
3^:208-214, 1994). Genetic and molecular evidence indicates that hedgehog proteins 
are secreted and function in extracellular signaling (Mohler, J., supra, Lee, et al, Cell, 
5 71:33-50, 1992; Taylor, et ai, Mech. Dev., 42:89-96, 1993). 

In vertebrates activities encoded by hh homologues have been implicated in ante- 
rior/posterior patterning of the limb (Riddle, et ai, Cell, 25:1401-1416, 1993; Chang, et 
ai. Development, 120:3339, 1994), and in dorsal/ventral patterning of the neural tube 
(Echelard, etal. Cell, 75:1417-1430. 1993; Krauss, eial. Cell, 21:1431-1444, 1993; 
10 Rodink, et al. Cell, 7g:761-775, 1994). 

The vertebrate ventral midbrain contains neurons whose degeneration or abnonnal 
function are linked to a number of diseases, including Parkinson's disease and schizophre- 
nia. It is known that motor nwrons develop in close proximity to the floor plate in the 
ventral midbrain. Midbrain projections to the striatum are involved in the control of 
voluntary movement (Bjorklund and Lindvall, In: Handbook of Chemical Neuroanatomy, 
eds., Borklund, et al.. Amsterdam: Elsevier. pp55-122, 1984) and loss of these neurons 
results in the motor disorders of Parkinson's disease (Hirsch, et al. Nature, 124 345, 
1988). Midbrain dopaminergic neurons that innervate limbic structures and the cortex 
influence emotional and cognitive behavior, respectively, and abnormal function of these 
neurons has been associated with schizophrenia and drug addiction (Seeman, et al., 
Nature. 26S A41. 1993). 

While the molecular nature of the. factors that specify neuronal cell fate have not been 
established, members of the transforming growth factor-P (TGF-P) (Lyons, et al.. Trends 
in Genetics, 2:408, 1991) or the hedgehog protein fiunily (Smith, J.C., Cell, 26: 193, 1994) 
25 may possess the characteristics expected from such factors as they participate in 
specification of cell fate, mediate inductive interactions between tissues, and in many 
cases aa at a distance of only a few cell diameters. 
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The present invention establishes that hh actrwties encoded by these genes play a crucial 
role in early patterning of the developing eye and in patterning of the brain. For the first 
time, the invention shows that internal cleavage of hedgehog protein product is critical 
for fiill fiincdon, and that the two novel products of this auto-proteolytic cleavage display 
distinguishable activities, thus demonstrating that hh signaling activity is a composite 
effect of two separate signaling proteins that derive from a common hh protein precursor. 
In so doing, the invention provides the means for specific patterning and proliferation of 
desired neuronal cell types for addressing disorders which arise from neuronal 
degeneration or abnormal function. 
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SUMMARY OF THE INVENTION 

The present invention is based on the seminal discovery that hedgehog proteins undergo 
auto-proteolytic cleavage v^ch resuhs in two separate proteins having distinct functional 
and structural characteristics. The two polypeptides, referred to as the "N" and "C" 
5 fragments of hedgehog, or N-terminal and C-terminal fragments, respectively, are 
produced after specific cleavage at a G'CF site recognized by the autoproteolytic domain 
in the native protein. 

Thus, in one embodiment, the invention provides a substantially pure polypeptide 
characterized by having an amino acid sequence derived from amino terminal amino 
10 acids of a hedgehog protein and having at its carboxy terminus, a Gl CF cleavage site 
specifically recognized by a proteolytic activity of the carboxy terminal fragment of the 
native hedgehog polypeptide. 

In another embodiment, the invention provides a substantially pure polypeptide 
characterized by having an amino acid sequence derived from carboxy terminal amino 
15 acids of a hedgehog protein and having at its amino terminus, a GICF cleavage site 
specifically recognized by a proteolytic activity of the carboxy terminal fragment of the 
native hedgehog polypeptide. 

The invention also provides a method for modulating proliferation or differentiation of 
neuronal cells, comprising contaaing the cells with a hedgehog polypeptide. The native 
20 hedgehog polypeptide, the N, or the C fi^igment, or functional fragments derived 
therefrom, are most usefiil for the induction of proliferation or differentiation of neuronal 
cells substantially derived from floor plate neuronal cells. 
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BRDEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows processing of the hh protein by immunoblots (A.C) with antibodies 
against amino (Abl) and caiboxy-tcrminal (Ab2) epitopes. HGURE IB and D are blots 
of samples immunoprecipitated with Abl (B, lanes 7-9), Ab2 (D. lanes 19-21), or pre- 
5 immune serum (B, lanes 10-12, and D, lanes 22-24). 

FIGURE IE shows a schematic illustration of the hedgehog cleavage mechanism. 

FIGURE 2 shows sequence similarity between hh proteins and serine proteases, hh 
protein sequences are aligned to residues 323 to 329 of the D. melanogaster protein and 
numbered as positions 1 to 7 (group A). The catalytic histidines of mammalian serine 
proteinases (group B) are aligned to the invariant histidine at portion 7 in hh proteins. 

FIGURE 3 shows autoproteolysis of the hh protdn. 3A shows a coomasie blue stained 
polyacrylamide gel showing production and purification of His«-U and -U329A 
protons from K colt. Samples were molecular weight markers (lanes 1 and 2); lysates of 
K coli cells carrying the His^-U expression construct without (lane 3) and with (lane 4) 
induction by IPTG; purified His^-U protein (lane 5); lysates of K coli cells that carry the 
HisrUm»A expression construct without (lane 6) and with (lane 7) induction by IPTG; 
purified HisrU„,»A protein (lane 8). FIGURE 3(B) is an immunoblot detected with Ab2 
showing transfected S2 cells induced to express hh (lane 1); His,-U and Hig-IA,,^ 
proteins incubated in cleavage reaction buffer for 0 hours (lanes 2 and 5), for 20 hours 
(lanes 3 and 6), and for 20 hours in the presence of 20 mM TAME (a serine protease 
inhibitor) (lanes 4 and 7). 

FIGURE 4 shows autoproteolytic fimctions of Drosophila (4A-C) and zebrafish (D) hh 
proteins map to the carboxy terminal fi-agments by in vitro translations of wild-type and 
mutant hh proteins. The locations of mutations and cleavage sites (arrows) in these 
25 proteins are illustrated schematically in 4E. 
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FIGURE 5 shows immunoblots showing heat shock induced expression of wild type and 
H329A mutant hh proteins in Drosophilia embryos (A) and (B) are immunoblots 
developed using Abl and Ab2 antibodies, respectively. Lanes 1 and 6, induced 
untransfected S2 cells; lanes 2 and 7, transfected S2 cells induced to express hh\ lanes 3 
5 and 8, heat shocked wild-type embryos; lanes 4 and 9, heat shocked hshh embryos; lanes 
S and 10, heat shocked hshh H329A embryos. 

FIGURE 6 shows in situ hybridization showing the embryonic effects of ubiquitously 
expressed wild type and H329A hh proteins. FIGURE 6 shows the embryonic 
distribution of wingless {wg) RNA as revealed by in situ hybridization is shown in (A) 

10 wild-type (homozygous y (B) hs/iA, and (C) hshh H329A embiyos that were 
exposed to two 10 minute heat shodcs separated by a 90-minute recovery period (33). 
>^dd-type embryos diowed little change in wg ^pression, whereas the wild-type protein 
and, to a lesser extent, the H329A protein each induced ectopic wg expression (Table 1). 
Panels (D), (E), and (F) show the dorsal surfiu:es of y hshh, and hshh H329A larvae, 

15 respectively, at the level of the fourth abdominal segment. These larvae were shocked for 
30 minutes as embryos and allowed to complete embryogenesis. Cuticle cell types (1 
2^ 3% and 4**) are labeled as described (J. Heemskerk and S. DiNardo, Cell 76, 449, 
1994). Note the expansion of 2° ceU types (naked cuticle) at the expense of 3 and some 
4*" types in the hsA/f embryo (E) under conditions where the phenotype of hshh H329A 

20 embryos (F) is identical to that of control embryos (D). 

5" i\' ■ 

FIGURE 7 shows X-gal staining to show imaginal disc effects of ubiquitous wild type 
and H329 hh proteins. X-gal staining was used to follow expression of wg (A-C) or dpp 
(D-L) in imaginal discs of late third-instar larvae that carry wg-lacZ or dpp-lacZ reporter 
genes. Leg (A-F), wing (G-I) and eye-antennal discs (J-L) from control larvae (A, D, G, 
25 J), larvae carrying the hshh transgene (B. E, H, K) and larvae carrying the hshh H329A 
transgene (C, F, I, L) are displayed. In all panels anterior is to the left. 
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FIGURE 8 (A) and (B) are fanmunoblots of cell pdlets (lane 1) or supernatants Oane 2) 
from transfccted S2 cell cultures expressing HH protein, developed with Abl (A) and 
Ab2 (B). Samples in each lane were from the same volume of resuspended total culture. 
Whereas N remained mostly associated with the cdl pellet (compare lanes 1 and 2 in A), 

5 C was nearly quantitatively released into the supernatant (compare lanes 1 and 2 in B). 
U displayed partitioning properties in between those of N and C (A and B). (Q 
demonstrates the heparin binding activity of various HH protein species generated by in 
vitro translations with microsomes (38). Samples were: total translation mix (lane 1); 
supernatant after incubation with heparin agarose or agarose (control) beads Oanes 2 and 

10 4); and material eluted from heparin j^arose or agarose beads after washing (lanes 3 and 
5). F, U, Nss and N fragments are depleted from reactions incubated vwth heparin agarose 
but not agarose beads (compare lanes 2 and 4 to 1), and the same species subsequently 
can be eluted from the heparin agarose but not the agarose beads (compare lanes 3 and 
S with lane 1). 

15 FIGURE 9 shows the differential localizations of N and C in embiyos by in situ 
localization of the hh transcript. Fig. 9 (A) is shown in comparison to the distribution of 
N and C epitopes detected with Abl and Ab2 m panels (B) and (C), respectively. Note 
that the distribution of N and C epitopes span approximately one-third and one-half of 
each segmental unit respectivdy, while the transoipt is limited to approximately 

20 one-quarter of each unit. In (D), the localization of C epitopes in embryos homozygous 
for the hh"' allele is detected with the use of Ab2. C epitopes in this mutant, which 
displays impaired auto-proteolytic activity (see text), are more restricted, and resemble 
the wild-type localization of N. Homozygous AA'" embryos were identified by loss of a 
marked balancer from a heterozygous parem stock. All embryos are at mid to late stoge 

25 9 (extended germ-band). 

FIGURE 10 shows a signal rday vwsus dual function models for hh protein action. In 
Fig. 10 (A), the long-range effects of hh signaling are achieved indirectiy through short- 
range induction of a second signaling molecule (X). Based on its biochemical properties 
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and its restricted tissue localization, N is presumed to represent the active short-range 
signal while the role of C would be limited to supplying the catalytic machinery required 
for biogenesis of N. In (B), the long- and short-range signaling functions of hh are 
supplied by the N and C proteins derived by internal auto-proteolysis of the U precursor. 
5 N is implicated in short-range signaling by retention near its cellular site of synthesis, 
while C is less restricted in its distribution and would execute long-range signaling 
functions. In both models, auto-proteolysis is required to generate fully active signaling 
proteins. 

FIGURES 10 C and D show an immunoblot of the N fragment synthesized from a wild 
10 type construct (C) or a consruct lacking the C donuun (D). 

FIGURES 1 1 A and B show the nucleotide and deduced amino acid sequences for partial 
human hh clones. 

FIGURE 12 A and B show in vitro cleavage reactions of a Drosophila hh protein 
produced in E, coli and purified to homogeneity. FIGURE 12, Panel A shows a time 

15 course of cleavage aflcr initiation by addition of DTT. Panel B shows incubations of 
concentrations ranging over three order of magnitude for a fixed time period (four hours), 
with no difference in the extent of conversion to the cleaved form. Panel C shows the 
sequence around the cleavage site as determined by amino-terminal sequence of the 
cleaved fragment C. The cleavage site is denoted by the arrow, and the actual residues 

20 sequenced by Edman degradation of the C fragment are underlined. Panel C also shows 
an alignment of all published vertd)rate hh sequences plus some of unpublished 
sequences from fish and Xenopus, The sequences shown correspond to the region of 
Drosophila hh where the deav^e occurs, and demonstrates the absolute conservation of 
the Gly-Cys-Phe sequence at the site of cleavage. Panel D shows a SDS-PAGE gel 

25 loaded with in vitro transcription/translation reactions as described in the previous 
Examples, using various hh genes as templates, dhh is Drosophila, twhh and zfshh are 
the twiggy-winUe and sonic hh genes of the zcbrafish, and vnshh is the shh/Hgh-I/vhh-J 
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gene of the mouse. Panel E shows that Edman degradation of the C fragments releases 
^'S counts on the first but not subsequent rounds for all these proteins, indicating that the 
site of autoproteolytic cleavage for aU of these M protdns is the amide bond to the 
amino-terminal side of the Cys residue that forms the center of the conserved Qy-Cys- 
5 Phe sequence highlighted in panel C. 

FIGURE 13 shows the predicted amino acid sequences are shown in single letter code. 
13(a) shows sequences common to five distinct /lA-like genes are shown with a dot 
indicating identity with the corresponding residue of zebrafish twiggy-winkie. 13(b) 
shows amino add sequences of twhh and sA/i are aligned to those of the soniclvhh'l class 
10 fi-om chick and mouse. The amino-terminal hydrophobic stretch common to all four hh 
genes is shaded. The asterisk (♦) denotes invariant amino acid residues associated with 
the proteolytic domain of C fragment from various species. 13(c) shows percent identity 
of residues carboxy-terminal to the hydrophobic region. 

FIGURE 14 shows a comparative expression of twhh, shh, and pax-2 during zebrafish 
15 embryogenesis. 

FIGURE 15 shows the effects of ectopic hh on zebrafish dcvelopmCTt. Wild type 
zebrafish, Danio rerio, Ekkwill Waterlife Resources) were maintained at 28.5T, some 
embryos were then cultured overnight at RT. Zebrafish embryos were injected at the 1-8 
cell stage with twhh, shh, or tocZRNA and examined at 28 h of development, (a-c) 
20 Dorsal view of the midbrain-hindbrain region; anterior is left, (a) /acZ. (b) twhh (c) shh. 
(d-f) Frontal optical section of the forebrain region; anterior is up (d) lacZ, (h) twhh, (f) 
shh. (g-1) Lateral view of the eye region; anterior is left, (g) tocZ (h) twhh. (i) twhh, 

FIGURE 16 is a table showing the effects of ectopic expression of shh, twhh and twhh 
mutants on zebrafish embryonic development. 
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FIGURE 17 shows zebrafish twiggy-winkle hedgehog derivatives. 17(a) Cartoons of 
various twhh open reading frames. SS (shaded) is the predicted N-terminal signal 
^ sequence for secretion of these proteins and encompasses the first 27 amino acids of each 
open reading frame. The arrow indicates the predicted internal ate of auto-proteolytic 

5 cleavage. Amino acid residue numbers are according to Figure 13b. The filled triangle 
denotes the normal termination codon for the twhh open reading frame. Construct Uha 
contains a mutation that blocks auto-proteolysis (the histidine at residue 273 is changed 
to an alanine; see Lee, J.J., et al., supra .V Construct U356ha contains a stop codon in 
place of amino acid residue 357 as well as the H273A mutation in Uha Construct N 

10 encodes just the first 200 amino acids of twhh. Construct C has had the codons for 
residues 31-197 deleted. 17(b) shows in vitro translation of the expression constructs 
shown schematically in part a. Constructs were translated in vitro in the presence of ^^S 
methionine and analyzed by autoradiography after SDS-PAGE. 

FIGURE 18 shows Northern blot analysis of the effect of hedgehog on expression of 
15 various neural nruu-kers. 

FIGURE 19 shows hh syner^ with naturally occurring neural markers or agents (e.g., 
XAG-1, XANF.2, Otx-A, En-2, Krox.20, Xlh box.6, NCAM. and EF-lo). 

FIGURE 20 shows Northern blot analysis of the effect of hedgehog N or C on various 
neural markers. 

20 FIGURE 21 shows AN-C interferes with X-bhh and N-activity in animal cap explants as 
shown by RT-PCR analysis. 



W09d/17924 



PCrA}S9S/lS463 



-11- 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides two novel polypeptides originally derived from a single 
precursor protein, both of \^ch have distinct structural and functional characteristics. 
The proteins are derived from a hedgehog protein and can be naturally produced by auto- 

5 proteolytic cleavage of the full-length hedgehog protein. Based on evidence provided 
herein, vMch indicates that hedgehog precursor protein and the auto-proteolytic products 
of hedgehog precursor protein are expressed in the floorplate of the ventral midline of the 
neural tube and notochord, the invention now provides a method for the induaion of 
proliferation or differentiation of neuronal cells associated with or in close proximity to 

10 the floorplate and notochord. 

In a first embodiment, the invention provides a substantially pure polypeptide character- 
ized by having an amino acid sequence derived from amino terminal amino acids of a 
hedgehog protein and having at its carboxy terminus, a glycine-cysteine-phenylalanine 
(GICF) cleavage site specifically recognized by a proteolytic activity of the carboxy 

15 terminal fragment of the native hedgehog polypeptide. This fragment is denoted the N- 
terminal fragment or polypeptide or "T^", herdn. For example, in the case of the 
Drosophila hedgehog, the N fragment includes amino acids 1-257 of hedgehog protein, 
wherein amino acids 85-257 have a molecular weight of about 19 kD by non-reducing 
SDS-PAGE (Amino acid residue numbers 1-257 include non-structural features such as 

20 signal sequences ). The G I CF cleavage site in Drosophila hedgehog precursor protein 
occurs at amino acid residues 257-259. Those, of skill in the art vyrill be able to identify 
the GICF cleavage site in otheri hedgehog genes, as the amino acid location will be 
similar and the site will be specifically recognized by the autoproteolytic activity of the 
corresponding C fragment. 

25 The N-terminal polypeptide is also characterized by being cell-associated in cells 
expressing the polypeptide m vitro, and being specifically localized in vertebrate or 
Drosophila cells or embryos, for example. In other words, this N-terminal fragment of 
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hedgehog, remains close to the she of cdlular synthesis. The association of N with the 
cell is a result of the processing event which involves lipophilic modification of the amino 
terminal domain. (See Figure IE) This modification is initiated by the action of the 
carboxy terminal domain, generating a thioester intermediate; the carboxy-terminal 
5 domain thus does not act simply as a protease, although cleavage of a peptide bond does 
uhimately result firom its action. In addition, the N firagment binds to heparin agarose in 
vitro. 

The N polypeptide of the invention is characterized by having an amino acid sequence 
derived fi-om amino terminal amino adds of hedgehog protein, e.g., 1-257 in Drosophila, 

10 wherein amino acids 1-257 have a molecular weight of about 19 kD by non-reducing 
SDS-PAGE. The N polypeptide includes smaller fi-agments which retain the fiinctional 
characteristics of fiill length N, e.g., bind to heparin. The hedgehog protein firom which 
N is derived includes, but is not limited to Drosophih, Xenopus, chicken, zebrafish, 
mouse, and human. Crystallographic analysis shows the structure of SHH-N includes 

15 the presence of a zinc ion. While: not wanting to be bound by a particular theory, the 
presence of the zinc ion is suggestive of zinc hydrolase activity. Zinc hydrolases include 
proteases such as carboxypeptidase A and thermolysin, lipases such as phospholipase C, 
and other enzymes such as carbonic anhydrase. Alterations in the zinc hydrolase site of 
the amino termirud signaling domain may be useful for modulating the range of difiusion 

20 of a hedgehog protein or to alter the signaling characteristics of the amino terminal 
signaling domain. For example, a mutation in the zinc hydrolase site may resuh in a 
tethered protein where ordinarily the protein is secreted at a distance. The result would 
be induction of a cell type not typically induced. Alteration in the zinc site may result in 
a molecule capable of inducing motor neurons and not floor plate, and vice versa. 

25 The identification of a cell-surface, or extracellular matrix localization of N and its • 
expression in notochord and floor plate-associated cells, provides a means for isolation 
or specific selection of cells expressing N, e.g. , to isolate a notochord sample or to isolate 
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floor plate ceUs. In addition, antibodies directed to N are useful for histological analysis 
of tissues suspected of expresnng N protein. 

The invention also provides a substantially pure polypeptide characterized by having an 
amino acid sequence derived from carboxy terminal amino acids of a hedgehog protein 

5 and having at its amino terminus a GICF cleavage site specifically recognized by a 
proteolytic activity of the carboxy terminal fragment of the native hedgehog polypeptide. 
This fragment is denoted the C-terminal fragment or polypeptide or T", herein. For 
example, in Drosophila this "C" polypeptide derives from the C-terminal domain of 
hedgehog precursor protein beginning at amino acid residue 258, wherein the fiill length 

10 C-tcnminal domain has a molecular weight of about 25 kD by non-reducing SDS-P AGE, 
a histidine residue at position 72, and has protease activity. The GiCF cleavage site 
specifically recognized by the proteolytic activity of the carboxy terminal fragment of the 
native hedgehog polypeptide is located at amino acid residues 257-259. As described 
above for the N firagment, now that the present invention has shown the precise cleavage 

15 recognition site for the autoproteolytic domain of hedgehog, those of skill in the art can 
readily discern the cleavage site in other hedgehog proteins thereby allowing the ready 
identification of any N or C polypeptide of any hedgehog precursor protein. 

The •'C" polypeptide of the invention is derived firom the C-terminus of a hedgehog 
precursor protein, beginning at the autoproteolytic cleavage site identified at the GCF 

20 amino acid sequence, which in Drosophila corresponds to amino acids 257-259. In 
Drosophila the histidine residue found invariably at amino acid residue 329 of the native 
hedgehog protein, and at amino add residue 72 of the C polypeptide, is essential for auto- 
proteolytic cleavage between amino acids 257 and 258 (G and C). Corresponding C- 
polypeptides of the invention will likewise contain a similarly located histidine residue 

25 \Aidi can be readily identified, such as by comparison to the Drosophila C-polypeptide. 
Among various species, the proteolytic domain can be characterized by the amino acid 
sequence -XTXXHLXX-. 
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The C polypeptide of the invention, unlike N, does not significantly bind to heparin 
agarose. C is characterized by being released into the culture supernatant of cells 
expressing C polypeptide in vitro and by being localized diflfuscly in cells and embryos. 
Because C polypeptide difiiises fi-eely, it would be detectable in various body fluids and 
5 tissues in a subject. Identification of C polypeptide expression near the niidline of the 
neural tube, as described herein, provides a usefid assay for neural tube closure in an 
embryo/fetus, for example. The presence of C polypeptide in amniotic fluid would be 
diagnostic of a disorder in which the neural tube may be malformed. 

Altered levels of C polypeptide in cerebrospinal fluid may be indicative of neuro- 
10 degenerative disorders, for example. Because C polypeptide is released from the cell 
after synthesis and autoproteolysis of native hedgehog precursor polypeptide, tumors 
synthesizing and releaai^ high levels of C polypeptide would be detectable without prior 
knowledge of the exact location of the tumor. 

C fragment is effective in inducing genes of the pituitary and anterior brain as well. In 
15 particular, induction is increased by the addition of a member of the TGF-P family of 
growth factors. For example, human activin in combination with C firagment may be 
. effective in enhancing pituitary cell growth and activity or development. 

C fragment is effective in inducing posterior markers of the brain by inhibiting N. Such 
a fiBgment is exemplified in Example 18 as AN-C. Therefore in another embodiment, the 

20 invention includes a polypeptide deleting amino acid residues 28-194 of X-bhh. 
(Autoproteolysis gives a C domain of 198-409 as well as a seven amino acid peptide, 
representing aa 24-27 and 195-197). This polypeptide blocks the activity of X-bhh and 
N in explants and reduces dorsoanterior structures in embryos. Also included are 
polynucleotide sequences encoding AN-C. AN-C is useful for increasing expression of 

25 posterior neural markers (e.g., En^2, Krox-20, Xlttbox-6) and decreasing expression of 
anterior neural markers (e.g., XANF-2. XAG-1, Otx-A) when desirable to do so to 
modulate neural patterning. 
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The term "substantially pure" as used herein refers to hedgehog N or C polypeptide 
which is substantially free of other protons, lipids, carbohydrates, nucldc adds or other 
materials vnth which it is naturaUy associated. One skilled in the art can purify hedgehog 
N or C polypeptide using standard techniques for proton purification. The substantially 
s pure polypeptide vnll yield a angle major band on a non-reducing polyacrylamide gel. 
The purity of the hedgehog N or C polypeptide can also be determined by amino-terminal 
amino acid sequence analysis. 

The invention includes a functional N or C polypeptide, and functional fragments 
thereof As used herein, the term "functional polypeptide" or "functional fragment" refers 

10 to a polypeptide which possesses a biological flmction or activity wWch is identified 
through a defined functional assi^ and which is associated with a particular biologic, 
morphologic, or phenotypic aheration in the cell. Functional fragments of the hedgehog 
N or C polypeptide include fragments of N or C polypeptide as long as the activity, e.g. . 
proteolytic activity of C polypeptide remains. Smaller peptides containing the biological 

15 activity of N or C polypeptide are therefore included in the invention. The biological 
function, for example, can vary from a polypeptide fragment as small as an epitope to 
which an antibody molecule can bind to a large polypeptide which is capable of 
paitidpating in the characteristic induction or programmmg of phenotypic changes within 
a cell. A "functional polynudeotide" denotes a polynucleotide vrtiich encodes a 

20 functional polypeptide as described herein. 

Minor modifications of the N or C polypeptide primary amino add sequence may resuh 
in polypeptides which have substantially equivalent activity as compared to the N or C 
polypeptide described herein. Such modifications may be deliberate, as by site-directed 
mutagenesis, or may be spontaneous. All of the polypeptides produced by these 
2S modificati(His arc induded herein as long as the proteolytic activity of C polypeptide, for 
example, is presem. Further, ddetion of one or more amino adds can also result in a 
modification of the structure of the resultant molecule without significantly altering its 
activity. This can lead to the development of a smaller active molecule which would have 
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broader utility. For example, it is possible to remove amino or caiboxy terminal amino 
acids vAnch may not be required for N or C polypeptide activity. 

The N or C polypeptide of the invention also includes conservative variations of the 
polypeptide sequence. The term "conservative variation" as used herein denotes the 

5 replacement of an amino acid residue by another biologically similar residue. Examples 
of conservative variations include the substitution of one hydrophobic residue such as 
isoleucine, valine, leucine or methionine for another, or the substitution of one polar 
residue for another, such as the substitution of arginine for lysine, glutamic for aspartic 
acids, or glutamine for asparagine, and the like. The terra "conservative variation" also 

10 includes the use of a substituted amino add in place of an unsubstituted parent amino acid 
provided that antibodies raised to the substituted polypeptide also inununoreact with the 
unsubstituted polypeptide. 

The N fiagmmt of the invention includes both the active form of the polypeptide and the 
N fragment including the uncleaved signal sequence. For example, in Drosophila where 
15 the signal sequence is internal (at about amino acids 60-80), the entire uncleaved N 
fragment beginning at the initiating methionine is inchided in the invention. Those of 
skill in the art can readily ascertain the nature and location of the signal sequence by 
using, for example, the algorithm described in von Heijne, G., Nuci Acids Res. 14:4683, 
(1986). 

20 The invention also provides an isolated polynucleotide sequence consisting essentially 
of a polynucleotide sequence encoding a polypeptide having the amino acid sequence of 
N or C polypeptide of the invention. The term "isolated" as used herein includes poly- 
nucleotides substantially free of other nucleic acids, proteins, lipids, carbohydrates or 
other materials with which it is naturally associated. Polynucleotide sequences of the 

25 invention include DNA, cDNA and RNA sequences which encode N or C polypeptide. 
It is understood that all polynucleotides encoding all or a portion of N or C polypeptide 
are also included herein, as long as they encode a polypeptide with N or C polypeptide 
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activity. Such polynucleotides include naturally occuning, synthetic, and intentionally 
manipulated polynucleotides. For example, N or C polypeptide polynucleotide may be 
subjected to site*directed mutageneas. The polynucleotide sequence for N or C 
polypeptide also includes antisense sequences. The polynucleotides of the invention 

5 inchide sequences that are degenerate as a resuh of the genetic code. There are 20 natural 
amino acids, most of which are specified by more than one codon. Therefore, all 
degenerate nucleotide sequences are included in the invention as long as the amino acid 
sequence of N or C polypeptide polypeptide encoded by the nucleotide sequence is 
functionally unchanged. In addition, the invention also includes a polynucleotide 

10 consisting essentially of a polynucleotide sequence encoding a polypeptide having an 
amino acid sequence of N or C and having at least one epitope for an antibody 
inmiunoreactive with N or C polypeptide. 

The polynucleotide encoding N or C polypeptide inchides the entire polypeptide or 
fi^gments thereof, as well as nucleic acid sequences complementary to that sequence. A 

15 complementary sequence may inchide an antisense nucleotide. When the sequence is 
RNA, the deoxynucleotides A, G, C, and T are replaced by ribonucleotides A, G, C, and 
U, respectively. Also included in the invention are firagments of the above-described 
nucleic add sequences that are at: least IS bases in length, which is sufficient to permit 
the fragment to selectively hybridize to DNA that encodes the protein under physiologi- 

20 cal conditions. Preferably, the fragments hybridize under stringent conditions. 

DNA sequences of the invention can be obtained by several methods. For example, the 
DNA can be isolated using hybridization techniques which are well known in the art. 
These inchide, but are not limited to: 1) hybridization of genomic or cDNA libraries with 
probes to detect homologous nucleotide sequences; 2) antibody screening of expression 
25 libraries to detect cloned DNA fragments with shared structural features; and 3) PCR 
amplification of a desired nucleotide sequence using oligonucleotide primers. 
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Prefcrably the hedgehog, N, or C poiymicleotide of the invention is derived from a 
vettebrate organism, and most preferably from human. So-eming procedures which rely 
on nucleic acid hybridization make it possible to isolate any gene sequence from any 
organism, provided the appropriate probe is available. Oligonucleotide probes, which 

5 correspond to a part of the sequence encoding the protein in question, can be synthesized 
chemically. This requires that short, oligopeptide stretches of amino acid sequence must 
be known. The DNA sequence encoding the protein can be deduced from the genetic 
code, however, the degeneracy of the code must be taken into account. It is possible to 
perform a mixed addition reaction when the sequence is degenerate. This includes a 

10 heterogeneous mixture of denatured double-stranded DNA. For such screening, 
hybridization is preferably performed on either single-stranded DNA or denatured 
double-stranded DNA. Hybridization is particularly useful in the detection of cDNA 
clones derived from sources where an extremely low amount of mRNA sequences 
relating to the polypeptide of interest are present. In other words, by using stringent 

15 hybridization conditions directed to avoid non-specific binding, it is possible, for 
example, to allow the autoradiographic visualization of a specific cDNA clone by the 
hybridization of the target DNA to that single probe in the mixture which is its complete 
complement (Wallace, ei aL. NucL Acid Res., 2:879, 1981). 

The development of specific DNA sequmces encoding hedgehog can also be obtained 
20 by: 1) isolation of double-stranded DNA sequences from the genomic DNA; 2) chemical 
manufacture of a DNA sequence to provide the necessary codons for the polypeptide of 
interest; and 3) in vitro synthesis of a double-stranded DNA sequence by reverse 
transcription of mRNA isolated from a eukaryotic donor cell. In the latter case, a double- 
stranded DNA complement of mRNA is eventually formed which is generally referred 
25 to as cDNA. 

• 

Of the three above-noted methods for developing specific DNA sequences for use in 
recombinant procedures, the isolation of genomic DNA isolates is the least common. 
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This is especially true when it is desirable to obtain the microbial expression of 
mammalian polypeptides due to the presence of introns. 

The synthesis of DNA sequences is frequently the method of choice when the entire 
sequence of amino acid residues of the desired polypeptide product is known. When the 

5 entire sequence of amino add residues of the desired polypeptide is not known, the direct 
synthesis of DNA sequences is not possible and the method of choice is the synthesis of 
cDNA sequences. Among the standard procedures for isolating cDNA sequences of 
interest is the formation of plasmid- or phage-canying cDNA libraries which are derived 
from reverse transcription of mRNA which is abundant in donor cells that have a high 

10 level of genetic expression. When used in combination with polymerase chain reaction 
technology, even rare expression products can be cloned. In those cases where significant 
portions of the amino acid sequence of the polypeptide are known, the production of 
labeled angle or double-stranded DNA or RNA probe sequences duplicating a sequence 
putatively present in the target cDNA may be employed in DNA/DNA hybridization 

15 procedures which are carried out on cloned copies of the cDNA which have been 
denatured into a single-stranded form (Jay, e( aL, NucL Acid Res., 11:2325, 1983). 

A preferred method for obtaining genomic DNA, for example, is Polymerase Chain 
Reaction (PCR), which relies on an in vitro method of nucleic acid synthesis by which 
a particular segment of DNA is specifically replicated. Two oligonucleotide primers that 

20 flank the DNA fragment to be amplified are utilized in repeated cycles of heat 
denaturation of the DNA, annealing of the primers to their complementary sequences, and 
extension of the annealed primers with DNA polymerase. These primers hybridize to 
opposite strands of the target sequence and are oriented so that DNA synthesis by the 
polymerase proceeds across the rpgion between the primers. Since the extension products 

25 themselves are also complementary to and capable of binding primers, successive cycles 
of amplification essentially double the amount of the target DNA synthesized in the 
previous cyde. The resuk is an exponential accumulation of the specific target fragment, 
approximately 2", where n is the number of cycles of amplification performed (see PCR 
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Protocols, Eds. Innis, et aL, Academic Press, Inc., 1990, incoiporated herein by 
reference). 

A cDNA expression library, such as Xgtl 1, can be screened indirectly for hedgehog, N, 
or C polypeptides having at least one epitope, using antibodies specific for hedgehog, N, 
5 or C. Such antibodies can be either polyclonally or monoclonally derived and used to 
detect expression product indicative of the presence of the desired hedgehog cDNA. 

The polynucleotide sequence for hedgehog, N, or C, also includes sequences 
complonentary to the polynucleotide encoding hedgehog, N or C (antisense sequences). 
Antisense nucleic acids are DNA or RNA molecules that are complementary to at least 

10 a portion of a specific mRNA molecule (Weintraub, Scientific American, 262:40, 1 990). 
The invention embraces all antisense polynucleotides capable of inhibiting production of 
hedgehog, N, or C polypeptide. In the cell, the antisense nucleic acids hybridize to the 
corresponding mRNA, forming a double*stranded molecule. The antisense nucleic acids 
interfere with the translation of the mRNA since the cell will not translate a mRNA that 

15 is double-stranded. Andsense oligomers of about 1 S nucleotides are preferred, since they 
are easily synthesized and are less likely to cause problems than larger molecules when 
introduced into the target hedgehog, N, or C-producing cell. The use of antisense 
methods to inhibit the translation of genes is well known in the art (Marcus-Sakura, Anal 
Biochem., 172:289, 1988). Inhibition of target nucleotide would be desirable, for 

20 example, in inhibiting cell-proliferative disorders, such as certain tumors, which are 
mediated by hedgehog, N or C. 

In addition, ribozyme nucleotide sequences for hedgehog, N or C are included in the 
invention. Ribozymes are RNA molecules possessing the ability to specifically cleave 
other single-stranded RNA in a manner analogous to DNA restriction endonucleases. 
25 Through the modification of nucleotide sequences which encode these RNAs, it is 
possible to engineer molecules that recognize specific nucleotide sequences in an RNA 
molecule and cleave it (Cech, J. Amer. Med, Asm., 2^:3030, 1988). A major advantage 
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of this approach is that, because they are sequence-specific, only inRNAs with particular 
sequences are inactivated. 

There are two basic types of ribozymes namely, tetrahymena-type (HasselhofF, Nature, 
114:585, 1988) and "hammerhead"-type. Tetrahymena-type ribozymes recognize 

5 sequences vAnch are four bases in length, while "hammerhead"-type ribozymes recognize 
base sequences 11-18 bases in length. The longer the recognition sequence, the greater 
the likelihood that sequence will occur exclusively in the target mRNA species. Conse- 
quently, hammerhead-type ribozymes are preferable to ietrahymena-type ribozymes for 
inactivating a specific mRNA spedes and IS-based recognition sequences are preferable 

10 to shorter recognition sequences. 

DNA sequences encoding hedgehog, N or C can be expressed in vitro by DNA transfer 
into a suitable host cell "Host cdls" are cells in which a vector can be propagated and 
its DNA expressed. The term also includes any progeny of the subject host cell. It is 
understood that all progeny may not be identical to the parental cell since there may be 
15 mutations that occur during replication. However, such progeny are included when the 
term "host cell" is used. Methods of stable transfer, meaning that the foreign DNA is 
continuously maintained in the host, are known in the art. 

In the present invention, the hedgehog, N or C polynucleotide sequences may be inserted 
into a recombinant expression vector. The term "recombinant expression vector" refers 

20 to a plasmid, virus or other vehicle known in the art that has been manipulated by 
insertion or incorporation of the hedgehog, N or C genetic sequences. Such expression 
vectors contain a promoter sequence which facilitates the eflBcient transcription of the 
inserted genetic sequence of the host. The expression vector typically contains an origin 
of replication, a promoter, as well as specific genes which allow phenotypic selection of 

25 the transformed cells. Vectors suitable for use in the present invention include, but are 
not limited to the T7-based expression vector for expression in bacteria (Rosenberg, el 
al.. Gene, 56:125, 1987), the pMSXND expression vector for expression in mammalian 
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cdls (Lee and Nathans* 1 Biol Chem,, 2^:3521, 1988) and baculoviius-derived vectors 
for expression in insect cells. The DNA segment can be present in the veaor operably 
linked to regulatory elements, for example, a promoter (e.g., T7, metallothionein I, or 
polyhedrin promoters). 

5 Polynucleotide sequences encoding hedgehog, N or C can be expressed in either 
prokaryotes or eukaryotes, although post-translational modification of eukaryotically 
derived potypq>tides, such as caiboxylation, would occur in a eukaryotic host. Hosts can 
include microbial, yeast, insect and mammalian organisms. Methods of expressing DNA 
sequences having eukaryotic or viral sequences in prokaryotes are well known in the art. 

10 Biologically functional viral and plasmid DNA vectors capable of expression and 
replication in a host are known in the art. Such vectors are used to incorporate DNA 
sequences of the invention. 

Methods which are well known to those skilled in the art can be used to construct 
expression vectors containing the hedgehog, N or C coding sequence and appropriate 
15 transoiptional/translational control signals. These methods include in vitro recombinant 
DNA techniques, synthetic techniques, and in vivo recombination/genetic techniques. 
See, for example, the techniques described in Maniatis, ei al, 1989 Molecular Cloning 
A Laboratory Manual, Cold Spring Harbor Laboratory, N, Y. 

A variety of host-expression vector systems may be utilized to express the hedgehog, N 
20 or C coding sequence. These include but are not limited to microorganisms such as 
bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors containing the hedgehog, N or C coding sequence; yeast 
transformed with recombinant yeast expression vectors containing the hedgehog, N or C 
coding sequence; plant cell systems infected with recombinant virus expression vectors 
25 (e.g., cauliflower mosaic vims, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors {e.g., Ti plasmid) containing the Hedgehog, N 
or C coding sequence; insect cell systems infected with recombinant virus expression 
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vectors (e.g., baculovirus) containing the hedgehog, N or C coding sequence; or animal 
cell systems infeaed with recombinant virus expression vectors (e.g.. retroviruses, 
adenovirus, vaccinia virus) containing the hedgehog, N or C coding sequence, or 
transformed animal cell systems engineered for stable expression. 

5 Depending on the host/vector system utilized, any of a number of suitable transcription 
and translation elements, including constitutive and inducible promoters, transcription 
enhancer elements, transcription terminators, etc., may be used in the expression vector 
(see e.g.. Bitter, e/ a/, 1987, Methods in Enzymology, 153:516-544). For example, when 
cloning in bacterial systems, inducible promoters such as pL of bacteriophage y, plac, 

10 ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. When cloning in 
mammalian cell systems, promoters derived from the genome of mammalian cells (e.g., 
metallothionein promoter) or from mammalian viruses (e.g., the retrovirus long terminal 
repeat; the adenovirus latie promoter; the vaccinia virus 7.5K promoter) may be used. 
Promoters produced by recombinant DNA or synthetic techniques may also be used to 

15 provide for transcription of the inserted hedgehog, N or C coding sequence. 

In bacterial systems a number of expression vectors may be advantageously selected 
depending upon the use intended for the expressed. For example, when large quantities 
of hedgehog, N or C are to be produced, vectors which direct the expression of high 
levels of fusion protein products that are readily purified may be desirable. Those which 

20 are engineered to contain a deavage site to aid in recovering are preferred. Such vectors 
inchide but are not limited to the E, coli expression vector pUR278 (Ruther, et al , EMBO 
J., 2:1791, 1983), in which the Hedgehog, N or C coding sequence may be ligated into 
the vector in frame with the lac Z coding region so that a hybrid -lac Z protein is 
produced; pIN vectors (Inouye and Inouye, Nucleic Acids Res , Jl:3101, 1985; Van 

25 Heeke and Schuster, J. Biol. Chem. 264:5503, 1989) and the like. 

In yeast, a number of vectors containing constitutive or inducible promoters may be used. 
For a review see, Current Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel, ei 
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o/., Greene Publish. Assoc. & MTUey Intersdence. Ch. 13; Grant. etaJ., 1987, Expression 
and Secretion Vectors for Yeast, m Methods in Erotology, Eds. Wu and Grossman, 
31987, Acad. Press, N.Y., Vol. 153, pp.516-544; Glover. 1986, DNA Cloning, Vol. II, 
IRL Pre». Wash., D.C., Ch. 3; and Bitter, 1987, Heterologous Gene Expression in Yeast, 
Methods in Enzymology, Eds. Berger and Kimmel. Acad. Press, N.Y., Vol. 152. pp. 673- 
684; and The Molecular Biology of the Yeast Saccharomyces. 1982. Eds. Strathem, et 
al. Cold Spring Harbor Press, Vols. I and n. A constitutive yeast promoter such as ADH 
or LEU2 or an inducible promoter such as GAL may be used (Cloning in Yeast, Ch. 3, 
R. RothsteinIn: DNA Cloning Vol.11, A Practical Approach, Ed. DM Glover, 1986, IRL 
Press, Wash., D.C.). Alternatively, vectors may be used which promote integration of 
foreign DNA sequences into the yeast chromosome. 

In cases where plant expression vectors are used, the expression of the hedgehog, N or 
C coding sequence may be driven by any of a nund>a- of promoters. For example, viral 
promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson. et al.. 
Nature, 2ifi:51 1, 1984), or the coat protein promoter to TMV (Takamatsu, et al., EMBO 
J., 6:307, 1987) may be used; alternatively, plant promoters such as the small subunit of 
RUBISCO (Coruzzi, et al., EMBO J., 1:1671-1680, 1984; Broglie, et al.. Science. 
^:838, 1984); or heat shock promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley, 
et al., Mol. Cell. Biol., 6:559, 1986) may be used. These constructs can be introduced 
into plant cells using Ti plasmids, Ri plasmids. plant virus vectors, direct DNA 
transformation, microinjection, electroporation, etc. For reviews of such techniques see, 
for example. Weissbach and Weissbach, 1988, Methods for Plant Molecular Biology, 
Academic Press, NY, Section VIIl, pp. 421-463; and Grierson and Corey, 1988, Plant 
Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9. 

An alternative expression system which could be used to express is an insect system. In 
one sudi sysxem, Autographa califomica nuclear polyhedrosis virus (AcNPV) is used as 
a vector to express forngn genes. The virus grows in Spodopterafrugiperda cells. The 
hedgehog, N or C coding sequence may be cloned into non-essential regions (for ocample 
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the polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for 
example the polyhedrin promoter). Successful insertion of the hedgehog, N or C coding 
sequence will result in inactivation of the polyhedrin gene and production of non- 
occluded recombinant virus (i.e., virus lacking the protebiaceous coat coded for by the 
s polyhedrin gene). These recombinant viruses are then used to infect Spodoptera 
frugiperda cdb in which the inserted gene is expressed, (e.g., see Smith, et al, J. Viol ., 
4fi:584, 1983; Smith, U.S. Patent No. 4,215,051). 

Eukaryotic systems, and preferably mammalian expression systems, allow for proper 
post-translational modifications of expressed mammalian proteins to occur. Eukaryotic 
10 cdls whidi possess the cdhilar machinery for proper processing of the primary transcript, 
glycosylation, phosphorylation, and advantageously, secretion of the gene product may 
be used as host cells for the expression of hedgehog, N or C. Mammalian cell lines may 
be preferable. Such host odl lines nuy inchide but are not limited to CHO, VERO, BHK, 
HeLa, COS, MDCK, -293, and WD8. 

15 Mammalian cell systems which utilize recombinant viruses or viral elements to direct 
expression may be engineered. For example, when using adenovirus expression vectors, 
the hedgehog, N or C coding sequence may be ligated to an adenovirus transcription/- 
translation control complex, e.g., the late promoter and tripartite leader sequence. This 
chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo 

20 recombination. Insertion in a non-essential region of the viral genome (e.g., region El 
or E3) will result in a recombinant virus that is viable and capable of expressing the 
protein in infected hosts (e.g., see Logan and Shenk, Proc. Natl. Acad Sci. USA, 813655, 
1984). Ahemativdy, the vaccinia virus 7.5K promoter may be used, (e.g., see, Mackett, 
et al., Proc. Natl. Acad Sci. USA\ 22;7415, 1982; Mackett, et al., J. Virol., 49: 857. 

25 1984; Panicali, etal., Proc. Natl. Acad Sci. USA, 22:4927, 1982). Of particular interest 
are vectors based on bovine papilloma virus which have the ability to rq>licate as 
extrachromosomal elements (Sarver, etal., Mol. Cell. Biol, 1:486, 1981). Shortly after 
entry of this DNA imo mouse cells, the plasmid replicates to about 1 00 to 200 copies per 
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cell. Transoiption of the inserted cDNA does not require integration of the plasmid into 
the host's chromosome, thereby yielding a high level of expression. These vectors can 
be used for stable expression by inchiding a selKtable marka* in the plasmid, such as, for 
example, the gene. Ahematively, the retroviral genome can be modified for use as 
5 a vector capable of introducing and directing the expression of the hedgehog, N or C gene 
in host cells (Cone and Mulligan, Proc. Natl Acad. Set USA, £1:6349, 1984). High level 
expression may also be achieved using inducible promoters, including, but not limited to, 
the metallothionine IIA promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable expression is 

10 prefored. Rather than using expression vectors which contain viral origins of replication, 
host cells can be transformed with the hedgehog, N or C cDNA controlled by appropriate 
expression control elements (e.g., promoter, enhancer, sequences, transcription 
terminators, polyadenylation sites, etc.), and a selectable marker. The selectable marker 
in the recombinant plasmid confers resistance to the selection and allows cells to stably 

15 integrate the plasmid into their chromosomes and grow to form foci which in turn can be 
cloned and expanded into cell lines. For example, following the introduction of foreign 
DNA, engineered cdls may be allowed to grow for 1-2 days in an enriched media, and 
then are switched to a selective media. A number of selection systems may be used, 
including but not limited to the herpes simplex virus thymidine kinase (Wigler, ei ai, 

20 Cell, 11: 223, 1977), hypoxanthine-guanine phosphoribosyltransferase (Szybalska and 
Szybalski, Proc, Natl Acad ScL USA. 48:2026, 1962), and adenine phosphoribosyU 
transferase (Lowy, et al. Cell, 22: 817, 1980) genes can be employed in tk', hgprt' or 
aprt* cells respectively. Also, antimetabolite resistance can be used as the basis of 
selection for dhfi-, which confers resistance to methotrexate (Wigler, et ai, Natl Acad, 

25 ScL USA, 22: 3567, 1980; OWare, et al, Proc. Natl Acad Set USA, 78: 1527, 1981); 
gpt, which confers resistance to mycophenolic acid (Mulligan and Berg, Proc, Natl Acad, 
Sci. USA, 2S: 2072, 1981; neo, which confers resistance to the aminoglycoside G-418 
(Colberre-Garapin, e/a/„ J. Mol Biol, 150:1, 1981); and hygro, which confers resistance 
to hygromycin (Santerre. et al. Gene, 20:147, 1984) genes. Recently, additional 
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selectable genes have been described, namely trpB, v/tich allows cdls to utilize indole 
in place of tryptophan; hisD, ^ch allows cells to utilize histinol in place of histidine 
(Haitman and MuDigan, Proc NatL Acad Set USA, M:8047, 1988); and ODC (ornithine 
decarboxylase) which confers resistance to the ornithine decarboxylase inhibitor, 2- 
(difluoromethyl>DL-omithine, DFMO (McConlogue L., 1987, In: Current Conimunica- 
tions in Molecular Biology, Cold Spring Harbor Laboratory ed.). 

Transformation of a host cell with recombinant DNA may be carried out by conventional 
techniques as are well known to those skilled in the art. Where the host is prokaryotic. 
such as £. coli, competent cells which are capable of DNA uptake can be prepared from 
cells harvested after exponential growth phase and subsequently treated by the CaCla 
method using procedures well known in the art. Alternatively, MgCl2 or RbCl can be 
used. Transformation can also be performed after fornung a protoplast of the host cell 
if desired. 

When the host is a eukaiyote, sudi methods of transfection of DNA as calcium phosphate 
co-precipitates, conventional mechanical procedures such as microinjection, electro- 
poration, insertion of a plasmid encased in liposomes, or virus vectors may be used. 
Eukaiyotic cells can also be cotransformed with DNA sequences encoding the hedgehog, 
N or C of the invention, and a second foreign DNA molecule encoding a selectable 
phenotype, such as the herpes simplex thymidine kinase gene. Another method is to use 
a eukaiyotic viral vector, such as simian virus 40 (SV40) or bovine papilloma virus, to 
transiently infect or transform eukaiyotic cells and express the protein, (see for example, 
Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Ghizman ed., 1982). 

Isolation and purification of microbial expressed polypeptide, or fragments thereof, 
provided by the invention, may be carried out by conventional means including 
prq)arative chromatography and immunological separations involving monoclonal or 
polyclonal antibodies. 
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The invention includes antibodira inuminoreactive with or which bind to hedgehog, N or 
C polypeptide or functional fragments thereof. Antibody ^ch consists essentially of 
pooled monodonal antibodies with different epitopic specificities, as well as distinct 
monoclonal antibody preparations are provided. Monoclonal antibodies are made from 

5 antigen containing fragments of the protein by methods well known to those skilled in the 
art (Kohler, et al, Nature, 256:495, 1975). The tem antibody as used in this invention 
is meant to include intact molecules as well as fragments thereof, such as Fab and F(ab')2, 
which are a^)able of binding an epitopic determinant on hedgehog, N or C. The 
antibodies of the invention include antibodies which bind to the N or C polypeptide and 

10 which bind with immunoreactive fragments N or C. 

The term "antibody" as used in this invention includes intact molecules as well as 
fragments thereof, such as Fab, FCab^, and Fv which are aq>able of binding the epitopic 
determinant. These amibody fragments retain some ability to selectively bind with its 
antigen or receptor and are defined as follows: 

15 (1) Fab, the fiagmcnt which contains a monovalent antigen-binding fr^ent of an 
antibody molecule can be produced by digestion of whole antibody with the 
enzyme papain to yield an intact light chain and a portion of one heavy chain; 

(2) Fab\ the fragment of an antibody molecule can be obtained by treating whole 
antibody with pepsin, followed by reduction, to yield an intact light chain and a 

20 portion of the heavy chain; two Fab' fragments are obtained per antibody 

molecule; 

(3) (Fab')2, the fragment of the antibody that can be obtained by treating whole 
amibody with Ae enzyme pepsin without subsequent reduction; F(ab*)2 is a dimer 
of two Fab' fragments held together by two disulfide bonds; 
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(4) Fv, defined as a genetically engineered fragment containing the variable 
genetically fused single chain molecule. 

Methods of making these fiagments are known in the ait. (See for example, Hariow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, New York 
5 (1 988), incorporated herein by reference). 

As used in this invention, the term "epitope" means any antigenic determinant on an 
antigen to which the paratope of an antibody binds. Epitopic determinants usually consist 
of chemically active surface groupings of molecules such as amino acids or sugar side 
chains and usually have specific three dimensional structural characteristics, as well as 
10 specific charge characteristics. 

Antibodies which bind to the hedgdiog, N or C polypeptide of the invention can be 
prepared u»ng an intact polypeptide or firagments containing snudl peptides of interest 
as the immunizing antigen. The polypeptide such as N or C, or firagments thereof used 
to immunize an animal can be derived fi'om translated cDNA or chemical synthesis which 
15 can be corijugated to a carrier protein, if desired. Such commonly used carriers which are 
chemically coupled to the peptide include keyhole limpet hemocyanin (KLH), 
thyrogiobulin, bovine serum albumin (BSA), and tetanus toxoid. The coupled peptide is 
then used to immunize the animal (e.g., a mouse, a rat, or a rabbit). 

If desired, polyclonal or monoclonal antibodies can be fiirther purified, for example, by 
20 binding to and elution from a matrix to which the polypeptide or a peptide to which the 
antibodies were raised is bound. Those of skill in the art will know of various techniques 
common in the immunology arts for purification and/or concentration of polyclonal 
antibodies, as well as monoclonal antibodies (See for example, Coligan, et ai. Unit 9, 
Current Protocols in Immurtology, Wiley Interscience, 1991. incorporated by reference). 
25 It is also possible to use the anti-idiotype technology to produce monoclonal antibodies 
which mimic an epitope. For example, an anti-idiotypic monoclonal antibody made to 
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a first monoclonal antibody ivill have a binding domain in the hypervariable region which 
is the "image" of the epitope bound by the first monoclonal antibody. 

Antibodies as described herein as having specificity for N polypeptide, e.g,, Abl 
(residues 83-160), are useful for specific identification of cells or tissues expressing the 
5 N fi-agment of hedgehog. Similarly, antibodies described herein as having specificity for 
C polypeptide, e.g., Ab2 (residues 300-391), are useful for specific identification of cells 
or tissues expressing the C fi-agment of hedgehog. Both antibodies, naturally, will also 
detect native hedgehog polypeptide. 

The N and C-specific antibodies of the invention are usefijl for purification of N and C 
10 polypeptide, respectively, especially using the antibodies 'immobilized on solid phase. 
By contacting a sample with anti-N antibody, both N and native hedgehog polypeptides 
can be isolated. By next contacting the sample removed by anti-N antibodies, with anti-C 
antibodies, the native hedgehog polypeptide is removed, thus allowing purification of N 
polypeptide. In a similar manner, C polypeptide can be antibody purified fi-om a sample. 

15 Monoclonal antibodies of the invention are suited for use, for example, in immunoassays 
in which they can be utilized in liquid phase or bound to a solid phase carrier. In 
addition, the monoclonal antibodies in these immunoassays can be detectably labeled in 
various ways. Examples of types of inununoassays which can utilize monoclonal 
antibodies of the invention are competitive and non-competitive immunoassays in either 

20 a direct or indirect format. Examples of such immunoassays are the radioimmunoassay 
(RIA) and the sandwich (immunometric) assay. Detection of the antigens using the 
monoclonal antibodies of the invention can be done utilizing immunoassays which are 
run in either the forward, reverse, or simultaneous modes, including immunohistochemi- 
cal assays on physiological samples. Those of skill in the art will know, or can readily 

25 discern, other inununoassay formats without undue experimenution. 
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The temi "immunometric assay" or "sandwich immunoassay", includes simultaneous 
sandvnch, forward sandwich and reverse sandwich immunoassays. These terms are well 
understood by those skilled in the art. Those of skill will also appreciate that antibodies 
according to the present invention will be useful in other variations and forms of assays 
5 which are presently known or which may be developed in the future. These are intended 
to be included within the scope of the presem invention. 

Monoclonal antibodies can be bound to many different carriers and used to detect the 
presence of N or C polypeptide. Examples of well-known carriers include glass, 
polystyrene, polypropylene, polyethyloie, dextran, nylon, amylases, natural and modified 
10 celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier can be 
either soluble or insoluble for purposes of the invention. Those skilled in the art will 
know of other suitable carriers for binding monoclonal antibodies, or will be able to 
ascertain sudi using routine experimentation. 

For purposes of the invention, N or C polypeptide may be detected by the monoclonal 
15 antibodies when present in biological fluids and tissues. Any sample containing a 
d^ectable amount of N or C can be used. A sample can be a liquid such as urine» saliva, 
cerebrospinal fluid, blood, serum and the like, or a solid or semi-solid such as tissues, 
feces, and the like, or, alternatively, a solid tissue such as those commonly used in 
histological diagnosis, C polypeptide in particular is detectable in biological samples, 
20 since it tends to difiiise more readily than N polypeptide. 

In performing the assays it may be desirable to include certain "blockers" in the 
incubation medium (usually added with the labeled soluble antibody). The "blockers" are 
added to assure that non-specific proteins, proteases, or anti-heterophilic immuno- 
globulins to anti-C or N immunoglobulins present in the experimental sample do not 
25 cross-link or destroy the antibodies on the solid phase support, or the radiolabeled 
indicator antibody, to yield false positive or false negative results. The selection of 
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l)]odcers*' therefore add subsumtially to the specificity of the assays described in the 
present invention. 

The invention also pro^des a method for modulating proliferation or differentiation of 
neuronal cells comprising contacting the cells vath a hedgehog polypeptide. The 

5 hedgehog polypeptide may be a native hedgehog polypeptide, or a N or C polypeptide, 
or functional fragments thereof Preferably, the modulation is induction of proliferation 
or differentiation of a particular cell type. This can involve either synergistic positive 
induction of neuronal cells by N, or negative modulation by delta N-C for example (Lai, 
etal. Development ]21.1^A9, 1995). Delta N-C enhances expression of posterior relative 

10 to anterior neural genes and does so through inhibition of N (see EXAMPLE 18 and 
Figure 18D). In addition to hedgehog polypeptide, a TGF-P factor may also be utilized 
in the method of the invention. 

Previous studies with the rat hedgjsshog gene showed that co-culture of cells expressing 
rat hedgehog precursor gene, with explant from neural tube, was sufficient to induce 

15 formation of motor neurons and floor plate from the explant (Jessesl, T., and Dodd, J., In 
Cell-Cell Signaling in Vertebrate Development (ed. E.J. Robertson, et a/., pp 139-155, 
San Diego, Ca.), 1993). Therefore, based on the Examples herein showing that hedgehog 
is expressed near the floorplate of the ventral midline of the neural tube and notochord, 
neuronal cells substantially derived from floor plate neuronal cells can be induced by 

20 contacting the cells with hedgehog, N or C polypeptide. As used herein, the term 
"substantially derived", refers to those cells from the floor plate or proximate to the floor 
plate. For example, such cells include motor neurons and dopaminergic neurons. Those 
of skill in the art will be able to identify other neuronal cells substantially derived from 
the floor plate. Preferably the cells are vertebrate cells and most preferably, human cells. 

25 In addition, as described herein in the Examples, hedgehog, and particularly C fragment, 
induces the expresaon of pituitary ^es. Hedgehog is also effective in inducing anterior 
brain gene expression as exemplified by the OTX-A marker. Further, the addition of a 
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TGP'^ hnafy member, for example acthrin, may be used to further induce expression of 
such genes. Other TCH^-P&milymanba'svvin be known to those of skill in the art. This 
apparent syneigy of fragments with TGF-p fiunily members occurs through the TGF-P 
protein inducing expression of neural inducers such as noggin and foUistatin. The hh 
fragment then synergizes with these inducers to pattern neural gene expression. 

hh fragmems may also be useful as nerve-sparing agents or in restoring or promoting 
appropriate patterning during the healing of major limb trauma. In addition, the N and 
C fragments may be useful in the area of genetic counseling. Specifically, familial 
midline defects such as cyclopia, Polydactyly or neural tube defects may be diagnosed 
by mapping close to hh. Since autoproteolytic defects may be responsible for the 
disorders, N or C therapy could be provided. 

The invention also provides an uitoproteolytic fiiMon protein comprising a first 
polypeptide inchiding the proteolytic domain of the C polypeptide of the invention, a 
cleavage »te recognized by the first pofypeptide, and a second polypqitide. (It is 
understood that the first and second polypeptides can be reversed.) The auto-proteolytic 
activity of the native hedgehog protdn is found entirely within the C polypeptide, 
therefore, the C polypeptide is usefiil for producing a fiision polypeptide which can then 
be cleaved at the junction of the C polypeptide and the second polypeptide. The fiision 
protein may optionally have a purification tag, such as a poly-histidine tag for isolation 
on a nickel column, or an antibody epitope tag, preferably on the C fi-agment. The 
cleavage site includes the sequence "GCF", which is recognized by the proteolytic 
domain of the C polypeptide and is utilized to cleave the second polypeptide fi^om the C 
fingment. Also included in the invention is a polynucleotide encoding the fiision protein 
of the invention. 

The invention also provides a method for produdng an autoproteolytic fiision protein 
comprising operably linking a first polynucleotide, wherein the first polynucleotide 
encodes a first polypeptide including the proteolytic domain of the C polypeptide of the 
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invention and the cleavage site recognized by the proteolytic donuun, and a second 
polynucleotide encoding a second polypq)tide. As described above, the fusion protein 
may also include a carrier peptide and/or a purification tag. 

The C polypeptide or functional fi-agment thereof is useful as a fusion partner to cause 
lipophilic modification and tethering of other proteins in vivo or in vitro. Such fusion 
proteins may be desirable for factors whose activity is required in a localized manner, 
either by targeting DNA constructs to specific cells or by introducing cells transfected 
with specific DNA constructs, for example. It may be desirable to lipid-modify a 
normally secreted protein in order to produce a cell-associated protein. For example, it 
may be desirable to produce a viral antigen that remains cell associated. 

Alternatively, the C polypeptide or fimctional fragments thereof can be used as a fusion 
partner with a protein of interest (e.g., Protein X fiised to hh-C domain). Such fusions 
form thiocsters at the junction between Protein X and hh-C (via an S to N shift). The 
thioesters are then available as substrates for a peptide ligation reaction in which any 
peptide or protein having an amino terminal cysteine (Peptide Y) is added and undergoes 
spontaneous rearrangement (S to N shift) that generates a stable peptide bond between 
Protein X and Peptide Y (Protein X-peptide bond-Peptide Y), For example, a protein 
that is toxic when produced in vivo could be produced in vitro using the hh-C domain 
fusion protein method. 

The fusion polypeptide may also include an optional carrier peptide. The "carrier 
peptide", or signal sequence, is located at the amino terminal end of the fusion peptide 
sequence. In the case of eukaryotes, the carrier peptide is believed to function to 
transport the fiiaon polypeptide across the endoplasmic reticulum. The secretory protein 
is then transported through the Golgi apparatus, into secretory vesicles and into the 
extracellular space or, preferably, the external environment. Carrier peptides which can 
be utilized according to the invention include pre-pro peptides which contain a proteolytic 
enzyme recognition site. Acceptable carrier peptides include the amino terminal pro- 
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r^on of calcitonin or other hormones, vAnch undergo cleavage at the flanking dibasic 
sites. However, it should be noted that the invention is not limited to the use of any 
particular pqrtide as a carrier. Other carrier peptides are known to those skilled in the art 
or can be readily ascertained without undue experimentation. 

5 In one embodiment of the invention, a carrier peptide which is a signal sequence is 
included in the expression vector, specifically located adjacent to the N-tenninal end of 
the fusion polypeptide. This signal sequence allows the fusion protein to be directed 
toward the endoplasmic reticulum. Typically, the signal sequence consists of a leader of 
fi-om about 16 to about 29 amino acids, starting with two or three polar residues and 

10 continuing with a high content of hydrophobic amino acids; there is othenvise no 
detectable conservation of sequence known. Such signal sequences are known to those 
of skill in the art, and include the naturally occurring signal sequence derived firom a 
hedgehog protein. 

The fusion polypeptide of the invention includes a polypeptide encoded by a structural 
15 gene, preferably at the amino-terminus of the fusion polypeptide. Any structural gene is 
expressed in coigunction with the C-polypeptide (polynucleotide) and optionally a carrier 
peptide. The structural gene is operably linked with the carrier in an expression vector 
so that the fusion polypeptide is expressed as a single unit. 

The identification of the autoproteolysis of hedgehog into the N and C domains is useful 
20 in a screening method to identify compounds or compositions which affect this 
processing activity. Thus, in another embodiment, the invention provides a method for 
identifying a composition which affects hh processing, which can be determined by 
activity or gene expression, comprising incubating the components, which inchide the 
composition to be tested {e.g., a drug, a small molecule, a protein) and a hh polypeptide 
25 or a recombinant cell expressing hedgehog or a gene encoding a C domain or functional 
fragment thereof operably linked to an N domain or fiinctional fi-agment thereof, under 
conditions sufficient to allow the components to interact, then subsequently measuring 
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the effect the composition has on hedgehog activity or expression. Fragments of 
hedgehog polypeptide or polynucleotide can be used in the method of the invention as 
long as autoproteolytic activity remains (e.g., the constnict exemplified in Figure 12a and 
12b, Example 10), The obsen^ effect on hh may be either mhibitory or stimulatory. For 

5 example, one can determine whether the N domain is associated with the cell, or whether 
the N domain is secreted into the medium, in other words, v*cther incomplete processing 
has occulted. Such methods for determining the effect of the compound or composition 
on hh processing include those described herein (see Example 10, Figure 12a and 12b) 
such as time course of autoproteolytic cleavage or course of cleavage based on 

10 concentration ranges. Alternatively, the effect of the composition on hh can be 
detennined by the expression of anterior or posterior neural markers. Other methods for 
detennining the effect of a composition on processing of N and C will be known to those 
of skill in the art. Various labels can be used to delect the N and C domains, for example, 
a radioisotope, a fluorescent compound, a biohiminescent compound, a chemiluminescent 

IS compound, a metal chelator or an enzyme could be used. Those of ordinary skill in the 
art will know of other suitable labels or will be able to ascertain such, using routine 
experimentation. 

As used herein, "hh activity** as described in the screening method refers preferably to 
autoproteolytic activity. However, it is understood, that one of skill in the art could use 
20 the above-described screening assay to identify a composition having an affect on other 
hh activities, for example, zinc hydrolase activity. Appropriate assays for determining the 
effect on such activities will be known to those of skill in the art. 



25 



The following examples are intended to illustrate but not limit the invention. While they 
are typical of those that might be used, other procedures known to those skilled in the art 
may alternatively be used. 
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HFDCEHOG PROTEIN PROCESSING 

The fiill length fonn of the hh protein (F) migrates with a mobility corresponding to a 
relative molecular mass of 46 kD. FIGURES 1 (A) and (C) are immunoblots with 

5 antibodies against amino* (Abl) and carboxy-terminal (Ab2) epitopes. GST fusion 
protdns containing either residues 83 to 160 or 300 to 391 from HH protein were 
expressed in Escherichia coli, purified as recommended [F. M. Ausubel, et aL, Current 
Prouxols in Molecular Biology (Greene and Wiley-lnterscience, New York, 1991)], and 
used to immunize rabbits by standard methods. The antibodies were affinity purified on 

10 a column of His6-U protrin [E. Harlow and D. Lane, Antibodies: A Laboratory Manual 
(Cold Spring Harbor Laboratory. Cold Spring Hart>or, NY, 1988)] linked to Aifi-Gel 10 
beads (Bio-Rad). The purification was perfonned as described (Harlow and Lane, supra) 
except that the acid and base elutions contained 10 percent dioxane. Biotinylated hh 
antibodies were prepared by purifying the rabbit antisera over a protein A colunm, 

15 followed by biotinylation with the use of the Immunoprobe biotinylation kit (Signui). 
Immunoprecipitations were performed as described [Harlow and Lane] with the use of 
cold RIPA lysis buffer containing 0.25 mM phenylmeth^sulfonyl fluoride (PMSF) and 
5 mM EDTA for tissue homogenizatibn. 'Lysates were precleared twice with pre-immune 
rabbit serum plus protein A beads (Gibco-BRL). AflBnity-purified antibodies or 

20 preimmune serum was then added, and the immunoprecipitation was performed with 
protein A beads, with the use of NP-40 lysis buffer for the washes. 

Immunoblots were performed with affinity purified Abl or Ab2 by either of two 
chemiluminesccnce based protocols. In the first protocol (used in Figures 1, 3, and 5) 
samples were resolved on 15 percent or 12 percent SDS-polyacrylamide gels (F. M. 
25 Ausubel et al., si^ra) and transferred to Magnagraph nylon membranes (MSI) by 
electroblotting. Blots were developed with the use of an alkaline phosphatase conjugated 
donkey anti-rabbit IgG secondary antibody and Lumi-Phos S30 (Boehringer Mannheim) 
under recommended conditions. In the second protocol (used in FIGURE 8), samples 
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were transferred to nitroceUulose filters (Schleicher and Schuell), and blots were 
developed using ECL reagents (Amersham) as reconunended. The secondary antibody 
in this case was horseradish peroxidase conjugated goat anti-rabbit IgG (Jackson 
InununoResearch). Lanes contain protdn from induced untransfected S2 cells (lanes 1 
5 and 13), transfected S2 cdls induced to express hh (lanes 2 and 14), imaginal discs Oanes 
3 and IS), wild type embryos (lanes 6 and 18), and in vitro translations of synthetic h 
mRNA both in the presence (lanes S and 1 7) and absence of microsomes (lanes 4 and 16). 

cDNAs encoding various hh protein species were cloned into the pMK33 vector, which 
allows for inducible expression under metallothionein promoter control (M. R. Koelle et 

10 al.. Cell 67:59,1991). Stable S2 cell lines were made by transfection of the hh/pMK33 
plasmids with constant selection for hygromydn resistance. Proteins were expressed by 
plating a log phase culture of cells diluted to 0.1 A», units, waiting 48 hours, inducing 
with CUSO4 at 0.2 mM final concentration, and harvesting the cells and/or supernatant 24 
hours later. Cell samples for immunoblotting were made by adding 10 volumes of IX 

15 SDS PAGE loading buffer to pelleted cells. 

In vitro translations were performed with the use of the TNT coupled transcrip- 
tion-translation system (Promega). "S methionine (DuPont NEN) was used for detection 
by autoradiography. In the heparin binding experiment in vitro translation lysate with 
microsomes that produce wild-type hh protein was added to heparin agarose (Sigma) or 

20 Sepharose CL-4B (Pharmacia) beads pre-equilibrated with heparin binding buffer (HBB; 
20 mM Tris (7.4), 150 mM NaCl, 0. 1 percent Triton X-100). Samples were incubated at 
4** C for four hours with gentle rocking. After pelleting the beads, supematants in some 
samples were analyzed (lanes 2 and 4). The beads were then washed 5 times with chilled 
HBB and samples (lanes 3 and 5) were subsequently eluted at 80** C for 10 minutes in 

25 SDS PAGE loading buffer (F. M. Ausubel et al., supra). 

Embryos fi^om the wild-type Canton-S line and from the matings, hshhfhshh or hshh 
H329A/hsWi H329A Xy, Sco/CyO, enlacZJI::wg (Kassis, et aL. Proc, Natl Acad. Sci. 
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U.S.A. 89: 1919, 1992), were coUected 0 to 16 hours after egg laying (AEL) at 25' C. 
They were heat shocked for 30 miiaites at 37" C and aUowed to recover for 1 hour at 25° 
C. Embryos in HGURE 1 (Canton-S) were collected 4 to 8 hours AEL at 25" C. In 
preparation for immunoblotting. all embryos were dechorionated in 2.6 percent sodium 
hypochlorite and homogenized in 10 vohmies of IX SDS PAGE loading buffer. 

Multiple species were detected and minor ctoss reactive bands are seen in most samples 
including extracts of induced untransfected S2 ceUs (lanes 1 and 13). One of these bands 
(occurring in both panels) co-migrates with U (at 39 kD) and is particularly abundant in 
lane 6 of FIGURE 1 (A). 

FIGURES 1 (B) and (D) are blots of samples immunoprecipitated with Abl (B, lanes 
7-9), Ab2 (D, lanes 19-21), or pre-immune serum (B, lanes 10-12 and D, lanes 22-24). 
Detection was with Wotinyteted derivatives of Abl (B) and Ab2 (D). Samples used were: 
induced untransfected S2 ceUs. lanes 7, 10, 19 and 22; transfected S2 cells induced to 
express hh, lanes 8, 11, 20 and 23; and embryos, tanes 9, 12. 21 and 24. For either 
antibody, hh protein ftagraents were specifically immunoprecipitated from hh expressing 
cells and embryos, but not from untransfected ceUs. (E) In the schematic diagram, 
cleavage sites are denoted by arrows. The cleavage site mariced by the asterisk is inferred 
by identification of only one cleavage product and may therefore occur at another 
location within the C fragment. The first two columns to the right of the diagram indicate 
the reactivity of Abl and Ab2 to each hh fragmem. The other columns indicate the 
presence (+) or absence (-) of each hh fragment in the various samples. Parentheses 
around F and N» indicate that these species are detected in in vitro translation reactions 
but not in vivo. 

The 46kD species was deteaed from in vitro transhttion extracts by Abl and Ab2 
(FIGURE 1, lanes 4 and 16), and was partially converted to a species of 39 kD (U) when 
translation occurred in the presence of microsomes (FIGURE 1, lanes 5 and 17). A 39 
kD species co-migrating with U is also presert in extracts from all in vivo sources, but 
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none of these extracts contain detectable levels of F. U represents the signal-cleaved 
form of F; signal cleavage thus appears to be relatively inefficient in vitro, as reported 
previously, (J. J. Lee, etal.. Cell, 2L'33, 1992), but is highly efficient in vivo. To confirm 
that signal cleavage indeed is occurring at this unusual internal location, a mutation that 

5 changes residue Sm to N at the predicted signal cleavage site was introduced. This 
mutation prevented conv^on by microsomes of F to U and also produced a species that 
comigrated with F upon transfection into cultured S2 cells. The effects of independently 
mutating the two methionine codons present upstream of the signal sequence were also 
examined. In vitro translation of the sequence in which the first methionine is removed 

10 produces a protein species intermediate in mobility between F and U, and this species is 
converted to a species that comigrates with U in the presence of microsomes or when 
produced in vivo. Alteration of the second methionine codon caused no change in the 
electrophoretic mobility of Hh protein produced in vivo or in vitro. 

Smaller species of Hh proteins fi-om in vivo sources have been reported previously (T. 
15 Tabata and T. B. Romberg, Cell 7g: 89, 1994). The latter study examined not 
endogenous proteins, but proteins induced to express at high levels firom exogenously 
introduced constructs. The antibody used did not distinguish epitopes fi-om distinct 
portions of the molecule. 

In addition to signal cleavage, a fiirther cleavage of the U precursor is responsible for 
20 generating other forms of hh protein observed in vivo This was deduced fi-om the 
observation that Abl and Ab2 both detected the U (uncleaved) species, but also interaaed 
individually with smaller protein species expressed endogenously in embryos and 
imaginal discs or with species expressed upon introduction of the hh gene into S2 cells. 
Abl thus interacts with a 19kD species fi-om all of these tissues (FIGURE 1, lanes 2, 3, 
25 6, 8, 9), while Ab2 interacts with a 2S kD species and a 1 6 kD species (FIGURE 1, lanes 
14, IS, 18, 20, 21). The 19 kD species hereafter is referred to as N (N-terminal fi-agment), 
the 2S kD species as C (C-temunal fi-agment) and the 16 kD species as C*; these species 
represent the major forms of endogenous hh protein present in vivo. 
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The proposed cleavages by whidi these species arise are shoiwn schematically in the 
bottom portion of FIGURE 1 . The N and C species are uniquely detected by Abl and 
Ab2, respectively, and the sum of the rdative masses of the two smaller species is 
roughly equivalent to the relative mass of U. The dectrophoretic mobilities of the F and 
U species are somewhat at variance with their predicted relative masses (52.1 kD and 
43.3 kD, respectively). The identities of these spedes were confirmed by in vitro 
translation of a variety of hedgehog open reading frames modified to contain different 
extents of sequence at the NH,- or COOH- terminus, and by insertion of epitope tags 
The migration anomalies appear to be assodated with protein species in which sequences 
from both the NHj- and COOH-terminal fragments are simultaneously present The 
mobilities of the NH,- and COOH-terminal fragments, in contrast, correspond to rdative 
masses (19 kD and 25 kD, respectivdy) that sum to yield 44 kD, roughly equivalent to 
the expected rdative mass of U. 

A simple mechanism that could account for the derivation of the two smaller spedes 
therefore would be a single internal deavage of the U precursor. Processing of the hh 
protein when translated in vitro also yidds a 25 kD spedes (C; lanes 16 and 17) and 
either a 29 kD or 19 kD (N) spedes (lanes 4 and 5). The 1 9 kD spedes comigrates with 
N, and its formation depends upon the presence of microsomes, consistent with the 
proposal that N derives from F by signal cleavage and a further internal cleavage. The 
overall pathway for formation of the predominant forms of hh protdn observed in vivo 
thus appears to invoh^e signal deavage of F to generate U. U is tiien deaved internally 
to form N and C, which are the predominant forms found in vivo. Further processing of 
the 25 kD C species might tiien generate the 16 kD C* species, but whether this 
processii^ is a angle cleavage event or not is not clear since Ab2 does not recognize the 
smaller 9 kD fi^gmem that would result. The processing of C to generate C* appears to 
occur witii greater efficiency in imaginal discs as compared to embryos (compare lanes 
15 and 18); this may be caused by tiie more extended mass isolation procedure of 
imaginal discs (O. M. Eugene, era/.. Tissue Culture Assn. Man.. S: 1055, 1979). 
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The coinigration of endogenous and in v/lra-generated hh protein species suggested that 
in vitro processing is sndlar to that observed in vivo. FIGURE 2 shows limited sequence 

5 similarity between hh proteins and serine proteinases, hh protein sequences are aligned 
to residues 323 to 329 of the D, melanogaster protein and numbered as positions 1 to 7 
(group A). Conserved hh residues are in bold letters. The catalytic histidines (A. J. 
Barrett, in Proteinase inhibitors A. J. Barrett, G. Salvesen, Eds. (Elsevier, Amsterdam. 
1986) pp. 3-22) of mammalian serine proteinases (group B) are aligned to the invariant 

10 histidine at position linHh proteins. Abbreviations are as follows: C-Shh, chicken Sonic 
M (R. D. Riddle, el a/., Ce//25: 1401, 1993); M-S«i, mouse Sonic hh (Y. Echelard et ai., 
Ce//7S: 1417, 1993) (identical to//Ag-l; R vA/r-1, rat vhh-l (H. Roelink et al., CellTj^: 
761, 1994); Z-S/i/i. zebrafish Sonic hh (S. Krauss, et al.. Cell 75: 1431. 1993) (identical 
to shh) and zebrafish v/r/r-1, (H. Roelink et al., supra)\ twhh, no other abbreviation; 

15 M-DMr, ntavLS& Desert Ml (Y. Echdard et al., CellJ^: 1417, 1993); M-IA/i, mouse Indian 
hh (Y. Echelard et al.. supra); CHT, bovine chymotrypsin; TRP, bovine trypsin; ELA, 
porcine elastase; UKH, human urokinase; CIR, human complement factor IR; CIS, 
human complement factor IS; MCP, rat mast cell protease; FAX, hurmn blood clotting 
factor X; TP A, human tissue plasminogen activator. 

20 Figure 2 shows that a seven residue region of hh coding sequence (residues 323 to 329 
in the Drosophila protein) displays some similarity to the sequences of serine proteases. 
This region lies approximately two thirds of the distance from the signal cleavage site to 
the cart>oxy-terminus, and includes Thr and His, residues (positions 4 and 7 in FIGURE 
2) that are invariant among all hh sequences from all species. In the serine proteases, this 

25 cons^ed sequence contains an invariant His that acts as a general base in catalysis (A. 
J. Barrett, 'm Proteinase inhibitors A. J. Barrett, G. Salvesen, Eds. Elsevier, Amsterdam, 
1986, pp. 3-22). 
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To determine whether this invariant His residue in the Wi protein indeed plays a role in 
auto-proteolysis, two proteins from iS: coli were purified: one carried the wild type 
sequence and the other a substitution of an Ala codon for the His codon at position 329 
(H329A). Both of these proteins were engineered to contain a hexa-Wstidine tag at the 
5 amino terminus fused to Drosophila sequences extending from a residue just before the 
signal cleavage site to the carboxy-terminus (residues 83 to 471; the wUd type form of 
this protein is referred to as HiSfU) Both proteins were extensively purified under 
denaturing conditions using a Ki** -chelating matrix. FIGURE 3(A) is a coomasie blue 
stained polyacrylamide gel that shows the production and purification of His.-U and 
10 His.-U„». proteins from K coli. Samples were molecular weight markers (lanes 1 and 
2); lysates of £ coli cells carrying the His,-U expression construct without (lane 3) and 
with (lane 4) induction by IPTG; purified Hia,-U protein (lane 5); lysates ofE. coli cells 
that cany the Hi8rU„». expression construct without (lane 6) and with (lane 7) induction 
by IPTG; purified HIstUhiw* protein (lane 8). Purified proteins were essentially 
IS homogeneous except for several minor species of lower relative mass; these species are 
endogenous breakdown products of the fidl-length proteins since they were absent in 
uninduced extracts and were detectable with hh antibodies. FIGURE 3 (B) is an 
immunoblot detected with Ab2 showing transfccted S2 ceUs induced to express hh Oane 
1); HvU and HisrU«», proteins incubated in cleavage reaction buffer for 0 hours Oanes 
20 2 and 5), for 20 hours (lanes 3 and 6), and for 20 hours in the presence of 20 mM TAME 
(a serine protease inhibitor) (lanes 4 and 7). Upon incubation the HivU, but not the 
HiS6-UH,«A protein, released a fragment presumed to be C on the basis of reactivity with 
Ab2 and co-migration with C produced in S2 cells. Release of C (lane 3) was only 
partially inhibited by TAME. 

25 Preliminary proteinase inhibitor studies have been performed on in vitro translated Hh 
protein by adding various inhibitors at the start of the translation reaction. These studies 
have been complicated by the fiict that numerous protease inhibitors lower or block 
translation efficiency. In some cases the effectiveness of an inhibitor was assayed by 
determining if addition of an inhibitor to a completed translation reaction wiU inhibit the 
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self-processing that normally continues to occur. At this time we can only state the 
following with certainty: (i) the serine protease inhibitor TAME 
(p-tohioiesulfonyl-L-arginine methyl ester) inhibits auto-proteolysis of in-vitro translated 
Hh protein; (ii) soybean trypsin inhibitor, a, anti-trypsin, aprotitin, leupeptin, and E-64 
5 do not block auto-proteolysis of translated Hh protein; and (iii) TAME partially inhibits 
auto-proteolysis of purified Hise-U protein (FIGURE 3, panel B). 

As seen in FIGURE 3B, upon dilution of denaturant the wild type protein but not the 
H329A mutant protein released a 25 kD species detectable by Ab2 and identical in 
mobility with the C species produced from in vitro translations and various in vivo 

10 sources. This cleavage was also observed when the wild type protein was purified and 
renatured by other protocols and cleaved under distinct conditions. Plasmids encoding 
the Ifis«-U and His6-Uic29A proteins were generated by inserting sequences corresponding 
to readues 83 to 471 firom the wild-type or hh H329A ORF into the pRSETB «pression 
vector (Invitrogen). Proteins were induced in BL21 (DE3)/pLysS E. coli cells as described 

15 (F. M. Ausubel et al., supra). The basic purification was performed on Ni-NTA agarose 
beads (Qiagen) by a denaturing protocol vath the use of 6 M guanidinium HCl and 8 M 
urea essentially as recommended (a detailed protocol of exact conditions used is available 
upon request). Washes contained 0.2 percent Tween 20 and S mM b-mercaptoethanol. 
The final wash buffer was: 6 M urjsa, 100 mM Tris, SOO mM NaCl, 20 percent glycerol, 

20 (pH 7.4). Ehitions were with the final wash buffer containing 250 mM imidazole In vitro 
cleavage reactions were performed by incubating the purified protein (diluted 1 :30 in the 
final mix) in cleavage buffer [50 mM Tris, 500 mM NaCl, 5 percent glycerol, 0.2% Triton 
X-100, 50 mM DTT, (pH 7.4)]. To isolate soluble fijil-length His^-U protein fi-ee fi-om 
denaturants or detergents, additional steps were taken (this refers to the other renaturation 

25 protocols mentioned in the text). Full-length protein fi^om the eluate described above was 
fiirther purified fi'om breakdown products by precipitation, by urea removal through 
dialysis. The precipitate was then re-solubilized in a buffer containing guanidinium HCl 
and loaded onto another >H-MrrA agarose cohjmn. After washing as described, the protein 
was re-folded (w^e attached to the beads) by gradual dilution of urea (fi^om 6M to O.SM) 
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with dilution buffer [(100 mM Tris. 500 mM NaCI, 20 percent glycerol, (pH 7.4)] over 
an 8 hour period at 4* C. The protein was dated with dilution buffer containing 250 mM 
imidazole and 0.5M urea. The eluate was dialyzed in 100 mM Tris, 150 mM NaCl, 10 
percent glycerol, (pH 7.4) at 4° C and stored at -70' C. 



5 EXAMPLE 3 

MAPPING THE A irrO.PROTEOI YTTC FUNCTIONS OF hh 

To more precisely define the domain of the hh protein responsible for this auto- 
proteolytic event, the effects of several distinrt types of mutations upon in vitro 
prooessiqg were examined. The most informative mutation was a deletion that removes 

10 residues 89 to 254 (A89-254), which togetiier constitute most of tiie amino acids within 
the portion of the molecule presumed to form the N firagment. In vitro translations of 
wild-type and mutant Hh proteins from Drosophila (FIGURES 4 A-C) and zebrafish 
(HGURE 4D) arc shown. The locations of mutations and cleavage sites (arrows) in these 
proteins are schematicaUy ilhistrated (nOURE 4E). In tiie Drosophila protein (HGURES 

15 4A, B, and C), auto-proteolysis is blocked or severely inhibited by several mutations in 
the COOH-temunus (H329A, 294 tninc. 410 trunc, flu408 and 456 trunc), but is 
unaffected by a large deletion (A89-254) or insertion of a flu-tag epitope trimer (flu227) 
in the NHi-terminus. Auto-proteolysis thus depends primarily on residues witiiin tiie C 
fragment (sequences to the right of the cleavage site in the diagram below; see HGURE 

20 1). Furthermore, the H329A/flu227 double mutant is not cleaved by wild-type protein in 
a mixing experiment Oane 11). suggesting an intramolecular mechanism for auto- 
proteolysis. Hh proteins encoded by the zebrafish genes twhh and shh display a pattern 
of processing (D) similar to tiutt of tiie Drosophila protein although tiie NH,-terminal 
fragment of each zebrafish protein (23 kD for twhh and 22 kD for shh) has a lower 

25 appaiem mass tiian the COOH-terminal Augment (25 kD for twhh and shh). This is the 
resuH of a shorter stretch of residues tfiat precedes tiie signal sequences as compared to 
the Drosophila protein. Processing is blocked by H273 A and H270A mutations in twhh 
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and shh proteins respectively (analogous to the H329A mutation in the Drosophila 
proteinX ^Mch suggests an auto-proteolytic processing mechanism is used similar to that 
observed for the Drosophila protein. 

In vitro translations were performed with the use of the TNT coupled 
5 transcription-translation system (Promega). "S methionine (DuPont MEN) was used for 
detection by autoradiography. In the heparin binding experiment (FIGURE 8C), in vitro 
translation lysate with microsomes that produce wild-type Hh protein was added to 
heparin agarose (Sigma) or Sepharose CL-4B (Pharmacia) beads pre-equilibrated with 
heparin binding buffer (HBB; 20 mM Tris (7.4), 150 mM NaCl, 0.1 percent Triton 
10 X-IOO). Samples were incubated at 4^ C for four hours with gentle rocking. After 
pelleting the beads, supematants in some samples were analyzed (lanes 2 and 4). The 
beads were then washed S times with chilled HBB and samples (lanes 3 and S) were 
subsequently ehited at 80*" C for 10 minutes in SDS PAGE loading buffer (F. M. Ausubel 
et al., supra). 

15 All mutations in the hh gene were generated in the plasmid pFl (J. J. Lee, et al,, supra). 
Mutations in the zebrafish twhh and shh genes were generated with the original cDNA 
clones as described (Ekker, et al. Current Biology. 5f8V 944,1995). All point mutations 
were generated with the use of recombinant circle PCR (D. H. Jones and S. C. 
Winistorfer, Biotechniques 12: 528, 1992). The flu408 and flu227 mutations were 

20 generated by inserting a trimer of the influenza hemagglutinin antigen (42 residues for 
flu408 and 43 residues for flu227) into the AlwN I and Bgl I sites present in the hh ORF 
(nucleotide positions 1604 and 1058 respectively) (J. J. Lee, et al., supra). The A89-254 
mutation was generated by removing sequences between the EcoN I site (644) and the 
Pml I site (1 145). The 294 trunc mutation was generated by removing sequences between 

25 the Acc I site (1265) and the Xcm I site (1792). The 410 trunc mutation was previously 
generated and identified as Hh^^^ (J. J. Lee, et al, supra). To map the mutation in the hh 
allele (base change Ci7S6 to A; coding change Tyss? to .STOP), DNA isolated from 
hh^^/TM3 was used to seed PCR reactions generating regions of the hh ORF and flanking 
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sequences, which were subdoned into Bhiescript KSM (Stratagene). Six clones each, 
derived from two different PCR aiiq>lifications were sequenced. 

As seen in lanes 1 and 2 of FIGURE 4A. this construct generates a fiiU length species of 
a mobUity corresponding to the expected relative mass of 33 kD, and two cleaved 
products whose apparent relative nuisses (25 and 9 kD) sum to give the relative mass of 
the larger species. The smaller of the cleaved products wiU occasionaUy migrate as two 
bands as seen in Fig 4A. We have chosen the lower of the two bands between the 1 4.3-kD 
and 6.2-kD markers for our molecular weight measurement. The larger of the two 
cleaved products comigrates with the C species produced from the wUd type protein, 
suggesting that the A89-254 hh protein contains the residues normally present in C and 
all of the detenninants required for auto-proteolysis, including the normal cleavage site; 
most of the residues within N are dispensable for auto-proteolytic activity. 

In contrast, lesions affecting residues presumed to lie within C block auto-proteolysis in 
vitro. All mutations tested by in vitro trandation were also examined in S2 cells by 
iramunoblottmg. In all cases the pattwns of cleavage in S2 cells were idemical to those 
observed in translations except that C» was always present whenever C was formed. The 
former fragmert was not observed in translations. These include the H329A mutation 
described above, a mutation that inserts an influenza virus epitope between residues 408 
and 409 (flu408). and three mutations that cause premature termination of the protein at 
the carboxy terminus. The two most severe truncations, 294 trunc and 410 trunc, are 
mutations generated in vitro. They cause a loss of 1 77 and 61 residues, respectively, from 
the carboxyl-terminus of the protein, and neither undergoes proteolysis. The 456 trunc 
hh protein is like that encoded by the EMS-induced mutant allele, which results in 
the loss of 15 residues from the caiboxy-terrainus of the protein. This protein undergoes 
auto-proteolysis, as demonstrated by the appearance of a 24 kD band in place of C, but 
the efficiency of the reaction is much impaired in vitro (nGURE.4B). Auto-proteolysis 
of the hh protein relies mainly upon residues within C; deletion or alteration of residues 
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within this domain is associated with reduced effidency of processing, and one such 
deletion appears to be the cause of the hh^^^ mutation. 

The sequence homology and auto-proteolytic function of the full length hh protein 
suggested the possibility that F or the C fragment is a sequence-specific protease. As a 
5 first step in clarifying the mechanism of auto-proteolysis, an influenza virus epitope tag 
was introduced into the N-terminus of a /zA open reading frame that also carried a H329A 
mutation. FIGURE 4C shows that the insertion of the epitope tag alone does not interfere 
with auto-proteolysis (lane 9), and yields a normal C fi-agment and an N fragment of 
increased relative mass (compare to wild type in lane 12). The protein carrying both 

10 mutations does not undergo proteolysis (lane 10), and since the epitope-tagged N 
fragment migrates differently from N, this double mutant provides an ideal substrate to 
look for intennolecular cleavage upon mixture with a wild type sequence. Lane 1 1 shows 
that in such a mixture, although normal N is formed, no tagged N can be detected. Thus, 
in this experiment, no appreciable intermolecular cleavage occurs. We also failed to 

15 detect intennolecular cleavage in the following two experiments: (i) co-transfection of 
wild type and 410 trunc sequences into S2 cells (the cleaved 410 tame protein would 
yield a snudler and therefore identifiable form of C); (ii) mbdng of excess unlabelled, 
purified His6-U protein with labelled, in vitro translated H329A mutant protein. Thus, 
although an intennolecular mechanism for regulation of auto-proteolysis or for cleavage 

20 of other proteins can not be ruled out, the current evidence suggests that cleavage of the 
hh protein occurs predominantly by an intramolecular mechanism. 

The hh gene has been broadly conserved in evolution, with single homologues 
unidentified in a wide variety of invertebrate species and multiple distinct homologues 
in each of several vertebrate species (Y. Echelard et al.. Cell 21. 1417, 1993; S. Krauss, 
25 ei ai. Cell 11: 143 1, 1993; H. Roelink et al., Cell, supra). As seen in FIGURE 2, all of 
these coding sequences contain an invariant histidine and other conserved residues at a 
position corresponding to H329 in the Drosophila protein. In addition, the protein 
encoded by at least one of the mouse genes appears to be processed in vivo to yield two 
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smaUer species in a manner resembling the in vivo processing of the DrosopMa protein. 
To detennine whether auto-proteolysis may also play a role in vertebrates we examined 
the behavior of proteins encoded by two distinct hh homologues from the lebrafish, twWi 
and 9hh. FIGURE 4D demonstrates that when these sequences are translated in vitro, 
smaller species are generated whose reUtive masses sum to yield approximately the 
relative mass of the fiiU length protein Canes I and 3). As seen in lanes 2 and 4. this 
cleavage reaction is blocked by substitution of Ala codons for the His codons at positions 
corresponding to H329 in DrosopMla (see FIGURE 2). Vertebrate hh proteins thus 
appear to be processed by a similar mechanism as the Drosophila proton. 



10 
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EXAMPLE 4 
p^l F OF A1TTO.PT?OTFOI ,YS1S IN EMBRYOS 

Numerous fanctions for the hh gene have been described in Drosophila. At the 
morphological level these include a role m patterning of larval cuticular structures and 
aduh structures such as the eye and appendages (C. Nusslein-Volhard wd E. Wieschaus. 
Nature 2SZ: 795. 1980; and J Mohlcr, Genetics m 1061, 1988).; the mechanistic basis 
for thee morphological effects involves signaling for maintenance or induction of gene 
expression in embiyos and imaginal discs (J. J. Lee. supra; T. Tabata and T. B. Kornberg. 
Ce//76: 89, 1994; and K. Basler and G. Stnihl, Nature m. 208. 1994). To ascertain the 
importance of auto-proteolysis for these functions, the H329A mutant gene under control 
of the hsp 70 promoter was introduced by P element-mediated transformation into the 
Drosophila germUne. The hsWi H329A construct was made identically to the hshh 
constnict whh the use of a «r ORF fragment containing the H329A mutation, Transgenic 
flies were generated from ay >f""parental strain using standard methods of? element 
mediated transformation (A. C. Spradling and G. M. Rubin. Science m. 341 1982). A 
Une. HA3. carrying the Ytshh H329A ? element on the second chromosome was 
maintained as a homozygous stock. To assay for expansion of stripes, embryos 
coUected at 4 to 6 hours after egg Uiying (AEL) at 25" C were subjected to the following 
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teat shodc protocols prior to fixation. Embryos receiving single shocks (10 or 30 minutes 
at 37*" C) were allowed to recover for 1 hour at 25* C. Embryos receiving double shocks 
(two 10 minute or two 30 minute shocks at 37* C) were allowed to recover 90 minutes 
after the first shock and 40 minutes after the second (Both recoveries were at 25° C. The 
5 double 30 minute protocol was as previously described, (S. Krauss, supra). In situ 
hybridizations were pwformed as described (D. Tautz, Chromosoma QJ: 8 1 , 1 989) using 
a wg specific probe (D. T. Chang et al., supra). Embryos assayed for cuticle phenotype 
were heat shocked 6 to 8 hours AEL for 30 minutes at 37* C, allowed to develop at 25 * 
C for 36 hours and then processed and mounted as described (M. Ashbumer, Drosophila: 

10 A Laboratory Manual Cold Spring Harbor Laboratory Press. New York, 1989). 
Immunolocalizations (single or double stains) were performed as described. With the use 
of affmity purified Abl or Ab2 for the primary antibody and alkaline phosphatase (A?) 
or horseradish peroxidase (HRP) conjugated anti rabbit or mouse IgG (Jackson 
Inununoresearch) for the secondary. Embryos fi-om a /iA"^/TM3 fizAwJ. (the balancer 

15 chromosome was firom the Bloomington Stock Center, strain 321 8) stock homozygous 
for the allele were identified by the lack of staining with an anti b-galactosidase 
antibody (Promega) in a double stain with Ab2 (FIGURE 9, panel D). Staining in 
FIGURE 9, panels B and C were performed formaldehyde fixed Canton-S embryos with 
the use of an AP conjugated anti-rabbit IgG secondary. Although standard formaldehyde 

20 fixation was generally used, heat and acid-formaldehyde fixation also gave similar 
resuhs. GST fusion proteins containing either residues 83 to 160 or 300 to 391 firom the 
Hh protein were expressed in coli, purified as reconunended (F. M. Ausubel et al., 
supra), and used to immunize rabbits by standard methods. The antibodies were affinity 
purified on a column of His^-U protein (Harlow and Lane, supra) linked to Affi-Gel 10 

25 beads (Bio-Rad). The purification was performed as described (Harlow and Lane, supra) 
except that the add and base elutions contained 10 percent dioxane. Biotinylated hh 
antibodies were prepared by purifying the rabbit anttsera over a protein A column, 
followed by biotinylation with the use of the Inmiunoprobe biotinylation kit (Sigma). 
Immunoprecipitations were performed as described (Harlow and Lane, supra) with the 

30 use of cold RIPA lysis buffer containing 0.2S mM PMSF and S mM EDTA for tissue 
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homogenization. Lysates were precleared twice with pre-imraune rabbit serum plus 
protein A beads (CHbco BRL). Affinity purified antibodies or pre-immune serum was then 
added, and the immunoprecipitation was performed with protein A beads, with the use 
of NP-40 lysis bufi^ for the washes. 

FIGURE 5 (A) and (B) are immunoblots developed with the use of Abl and Ab2 
antibodies respectively. Lanes 1 and 6, induced untransfected S2 cells; lanes 2 and 7. 
transfected S2 cells induced to express hh, lanes 3 and 8. heat shocked wild-type 
embryos; Umes 4 and 9. heat shocked hsW» embryos; lanes 5 and 10. heat shocked hshh 
H329A embryos. In heat shocked hshh embryos, the wUd-type Hh protein is both induced 
and properly processed to generate the U, N. C and C* species seen in other expression 
contorts. In contrast, the H329A is induced but not appreciably processed in hshh H329A 
embryos (the low levels of processed species in lanes 5 and 10 are probably from 
endogenous hh expression since they are seen at identical levels in heat shocked 
wild-type embryos in lanes 3 and 8). 

FIGURE 5 shows that heat shock induction results in the formation of an abundant 
species that corresponds to U based on its mobility and its interaction with Abl and Ab2 
(lanes 5 and 10). In contrast, mduction of wild type hh protein using a similar contruct 
resulted in similar levels of the N and C processed products Oanes 4 and 9). with veiy 
little uncleaved U. Thus, as observed in vitro and in S2 ceUs, the H329A mutation in 
embryos appears to greatly reduce the efficiency of auto-proteolytic cleavage of the hh 
protein. 

In FIGURE 6. the embryonic distribution of wingless (wg) RNA as revealed by in situ 
hybridization is shown in FIGURE 6 (A) wild-type (homozygous/ >»/"■•). (B) hshh, and 
(C) hs«i H329A embryos that were exposed to two 10 minute heat shocks separated by 
a 90-minute recovery period. WUd-type embryos showed little change in wg expression, 
whereas the wild-type protein and, to a lesser extent, the H329A protein each induced 
ectopic wg expression (Table 1). Panels (D). (E), and (F) show the dorsal surfaces ofy 
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w"'", hshh, and hshh H329A larvae, respectively, at the level of the fourth abdominal 
segment. These larvae were shocked for 30 minutes as embryos and allowed to complete 
embryogenesis. Cuticle cell types (r, 2"", i"", and 4**) are labeled as described (J. 
Heemskerk and S. DiNardo, Cell 2^: 449, 1994). Note the expansion of 2° cell types 
5 (naked cuticle) at the expense of 3*" and some 4° types in the hshh embryo (E) under 
conditions where the phenotype ofhshh H329A embryos (F) is identical to that of control 
embryos (D). 

Perhaps the earliest known requirement for Hh protein is in maintenance of an adjacent 
stripe of wingless (wg) gene expression in each embryonic segment (A. Martinez Arias, 

10 eial,. Development \Q3: 157, 1988; and S. DiNardo, e( al, Nature 332: 604, 1988). This 
requirement is deduced from the loss of wg expression when hh function is absent; in 
addition, the ubiquitous expression of wild-type Hh protein induces expansion of the 
domain of gene expression (P. W. Ingham, Nature 560, 1993). The effects of the 
H329A mutation upon wg expansion were examined by heat shocking embryos carrying 

15 the H329A mutant construct in parallel with embryos containing the wild-type construct. 
Although the H329A mutant protein is able to induce some expansion of the wg domain, 
the efiBciency of this activity is impaired relative to that of the wild-type protein 
(FIGURE 6, B and C; Table 1). The difference in efficiency ranges nearly as high as 
threefold depending upon the heat shock regime, and these results suggest that a 

20 uto-proteolysis of the Hh protein is important for optimal activity in embryonic signaling 
to induce wg expression. 
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TABLE 1 

yf WfL*^ .nH n.» t niit ftft «Hhrvnnir inrtiirtioii ftf wF fiinmgwn 

nunutes of heat shodc 

,0 30 10/10 30/30 

hsAA 1.0* 0.3(93) 1.5 * 0.6(120) 2.9 *0.3(41) 2.8*0.4(54) 

hsW.H329A 0.7*0.5(190) 0.9*0.4(111) 1.1*0.4(145) 1.9*0.5(93) 



Expansion of wg expression beyond wild-type controls is given as average number of 
ceU diameters * standard deviation with number of embryos scored in parentheses. 
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The effects of Hh protem on the patterning of cuticular structures are most clearly visible 
on the dorsal surface of the larva, where four distinctive cell types can be identified in 
each parasegment. These cell types have been designated T, 2/ 3"*, and 4*", from 
anterior to posterior, with hh transcription occurring in precursors of the 1 cells (J. 

s Heemskerk and S. DiNardo, supra). Difierentiation of the first three cell types was shown 
to be dependent upon hh gene function, and it has been proposed that the fates of these 
cells are determined by the concentration of Hh protein, with highest concentrations 
producing the 1 ° fate, intermediate concentrations producing the 2° fate, and the lowest 
concentrations producing the 3"* fate (J, Heemskerk and S. DiNardo, supra). This 

10 proposal was supported by observations that the most anterior cell types display the 
greatest sensitivity to a reduction of hh expression, and that all of the 3 " and some of the 
4'' bristles are replaced by naked cuticle characteristic of the more anterior 2*" cell type 
when hh is expressed ubiquitously at high levels. We have reproduced suppression 3 ° and 
some 4'' fates by heat shock induction of embryos that carry our wild-type construct 

15 (FIGURE 6E), but find that the H329A mutant is unable to alter cell fates in the dorsal 
cuticle of the larva (FIGURE 6F). Auto-proteolysis, or perhaps some other function 
blocked by the H329A mutation, thus appears to be essential for the patterning influence 
of Hh protein upon the dorsal cuticle. 



20 EFFECTS OF THE H329A MUTATION 

UPON SIGNALING IN IMAGINAL DISCS 

Studies of H329A mutant protein were extended to the function to the patterning of adult 
structures and signaling within imaginal discs. In the eye imaginal disc hh function is 
required for appropriate development of pattern (J. Mohler, Genetics 120: 1061, 1988; 
25 J. J. Lee, supra, and J. Mohler and K. Vani, supra) and more recently has been shown to 
control progression of a wave of differentiation via induction of decapentaplegic {dpp) 
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gene expression in the morphogenetic fiinow of the eye (U. Heberlein. et ai. Cell 25; 
913. 1993; and C. Ma. ai. Cell 2^: 927, 1993). In leg and wing discs, ectopic 
expression of «» has also been shown to yield pattern dupUcations and defects and is 
associated with induction of ectopic expression of other signaling molecules normally 
expressed in a zone along the anterior/posterior compartment boundary (T. Tabata and 
T. B. Koraberg. Ceim 89. 1994; and K. Easier and G. Struhl. Nature m. 208. 1994). 

For studies of signaling in imaginal discs, a thermal cycler was utilized to subject larvae 
carrying heat shock-indudble hh constructs to successive rounds of heat shock and 
recovery. The effects of temperature cycling upon expression of dpp and wg in imaginal 
discs was examined by monitoring B-galactosidase expression from a reporter gene 
carrying <^ promoter sequences or from an enhancer detector P elemem inserted in the 
wg gene. In HGURE 7. X-gal staining was used to foBow expression of HGURE 7 
(A-C) or dpp HGURE 7 (D-L) in imaginal discs of Ute third-instar larvae that carry 
wg.lacZ<xdpp-lacZxt^n^ genes! Ug (A-F). wing (G-I) and eye-antemial discs (J-L) 
from control larvae (A, D. G, J), laiyae carrying the hshh transgene (B. E. H. K) and 
larvae carrying the hahh H329A transgene (C, F. I. L) are dispteyed. In all panels anterior 
is to the left. Arrows higWight the following features: an ectopic patch of dpp expression 
in the anterior compartment of wing discs in hsMr H329A larvae G); and an ectopic band 
of dpp expression in eye portion of the eye-antennal disc anterior to the morphogenetic 
fijiTOw (mariced by the other band oidpp expression more posteriorly) in hsWi larvae (K). 
Expansion into the anterior compartment of wg expression in leg discs, and dpp 
expression in leg and wing discs in hsW» larvae is similar to that described for the ectopic 
expression oihh. Morphological changes in the anterior compartment of leg (B and E) 
and wing discs (H) were also as described (K. Easier and G. Struhl. supra). In contrast, 
discs from Yahh H329A and control larvae showed very little change in wg and dpp 
expression, even under prolonged heat shock conditions and morphological changes were 
never observed. (M-0) The eye phenotypes of adult control (M). hsWi (N) and hsA/i 
H329A (O) ffies that were shocked during larval development in a manner simUar to that 
of the imaginal disc experiments above. Duplicated eye structures were observed in hsAA 
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flies, but never in hshh H329A flies. The arrow in (N) points to a thin strip of cuticle 
betweoi the two eye structures. Other deformities were also seen in hshh flies (for 
example, compare the thorax in N to M) . 

Virgin female flies fi-om the homozygous lines hsA/r (D. T. Chang et al.. Development, 
5 1 994, in press), hshh H329A, and y w"" were crossed to males from the homozygous 
BS3.0 line (bearing a P element c^p reporter construct on the 2nd chromosome, referred 
to as dpp4acZ) (R. K Blackman, etal. Development lU: 657, 1991) or the line>'; Sco/ 
CyO, enlacZJI::wg (bearing a wg reporter P element enhancer trap on a second 
chromosome balancer; called wg-lacZ) (J. A. Kassis, et ai, Proc, Natl, Acad, Sci. U.S.A. 

10 iS.. 1919, 1992). Progeny were grown at 25*'C in aerated 0.5-ml microcentrifuge tubes 
containing yeast paste until the late second instar or early third instar stage of larval 
development. The larvae were thai cycled continuously at 37** C for 30 minutes followed 
by 25 C for 90 minutes in a Perkin-Elmer thermal cycler until they reached the late third 
instar stage. They were subsequ^tly dissected and stained with X-gal as described (M. 

15 Ashbumer, supra) or allowed to grow to aduhhood for phenotypic analysis. 

As shown in FIGURE 7A, wg expression normally occurs in a ventral sector of the leg 
disc along the anterior/posterior compartment boundary while dpp is expressed in the 
dorsal portion of the disc along this boundary (FIGURE 7D). Although thermal cycling 
of larvae carrying the wild-type hh gene produced abnormal leg disc morphology and 

20 extensive ectopic expression of both target genes, as previously reported for ectopic hh 
expression (FIGURE 7B and E), the H329A construct produced little if any detectable 
difference in these patterns of expression (FIGURE 7, C and F). Ectopic hh expression 
in the wing disc also leads to morphological changes and expanded expression of {^p 
(compare FIGURE 7, G and H), but the H329A construct produced only an occasional 

25 small patch of anterior ectopic expression (FIGURE 71). 

Ubiquitous expression of wild-;type hh also leads to ectopic expression of cfytp in the 
eye-antennal disc (compare FIGURE 7, J and K). In the antennal ponion of this disc the 
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expansion of c^p expres^on resmibles that observed in leg discs. In the eye portion of 
the disc expresaon is observed at its normal location in the furrow; however, ectopic 
expression also occurs in the form of a second dorso-ventral band at a location somewhat 
anterior to the fimow, thus giving the appearance of an eye disc with two morphogenetic 

5 furrows (FIGURE 7K). Indeed, in adults derived from temperature-cycled larvae that 
carry the wild-type hh construct, an apparently duplicated eye structure such as that in 
FIGURE 7N can be observed, with two eye structures separated by a thin strip of cuticle 
(arrow). The H329A mutant protein, in contrast, did not induce expansion of e^p 
expression in either portion of the eye-antennal disc (FIGURE 7L), and does not induce 

10 eye duplications or cuticle defects in the adult (FIGURE 70). 

The experiments described thus far comprise multiple series of larvae subjected to two 
days of thermal cycling followed by inunediate dissection for analysis of imaginal 
structures or further incubation at constant temperature for analyas of aduh structures. 
Although the H329A protein appeared to have little activity in these experiments, the 

15 small patch of ectopic c^p expression induced in the wing disc (FIGURE 71, arrow) 
suggested that some residual activity remained. This suggestion was borne out in a similar 
experiment invohdng three days of cycling prior to dissection: the H329A protein clearly 
displayed some £^/7-inducing activity in this experiment, presumably as a result of the 
higher amounts of protein that accumulated during the longer cycling period. The wing 

20 in particular, but also other imaginal discs, displayed low and variable amounts of ectopic 
oq>ression. This expression in all cases was far less extensive than that observed for 
the wild-type construct examined in parallel; furthermore, morphological deformations 
of the imaginal discs, although quite common with the wild-type protein, were extremely 
rare with the H329A protein. Although its potency is greatly reduced relative to 

25 wild-type, the H329A protein retained at least some activity in early embryonic and 
inuiginal disc induction of wg and c^p expression; in contrast, even under heat shock 
conditions far more severe than those required for effects by the wild-type protein, the 
H329A mutant remained completely inert with respect to the re-spedfication of cell fates 
in the dorsal cuticle of the larva. 



96/17924 



PCT/US9S/1S463 



-58- 

EXAMPLE 6 
DIFFERENTIA T. RELEASE OF N AND C 
INTO CULTURED CELL SUPERN ATANTS 

A puzzling feature of hh function is its apparent short-range action in settings such as 
embryonic and imaginal disc signaling to wg and <^/?, and longer-range action in other 
settings, such as patterning of the dorsal larval cuticle. These observations and the 
existence of two major protein products in vivo prompted us to look for differences in the 
sohibility or difiiisibility of N and C expressed in S2 cultured cells. FIGURES 8 (A) and 
(B) are immunoblots of cell pellets (lane 1) or supematants (lane 2) from transfected S2 
cell cultures expressing Hh protein, developed with Abl (A) and Ab2 (B). Samples in 
each lane were from the same volume of resuspended total culture. Whereas N remained 
mostly associated with the cell pellet (compare lanes 1 and 2 in A), C was nearly 
quantitatively released into the supernatant (compare lanes land 2 m B). U displayed 
partitioning properties in between those of N and C (A and B). (8C) demonstrates the 
heparin binding activity of various Hh protein species generated by in vitro translations 
with microsomes. Samples were: total translation mix (lane 1); supernatant after 
mcubation with heparin agarose or agarose (control) beads (lanes 2 and 4); and material 
eluted from heparin agarose or agarose beads after washing (lanes 3 and 5). F, N,, and 
N fragments are depleted from reactions incubated with heparin agarose but not agarose 
beads (compare lanes 2 and 4 to 1), and the same species subsequently can be eluted from 
the heparin agarose but not the agarose beads (compare lanes 3 and 5 with lane 1). 
FIGURES 8, A and B indeed show that these proteins behave differently, with most of 
the N fragment remaining cell-associated and all, or nearly all, of C being released into 
the culture supernatant. 

One possible explanation for this differential behavior might be association of the N 
fragment with extracellular matrix proteins on the surfaces of the S2 cells. Accordingly, 
the relative affinity of these two proteins for heparin agarose was examined, since heparin 
binding is a common property of proteins that associate with the extracellular matrix. 
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Given the obvious difficulty in obtaining soluble N from cultured cells, in vitro, 
translation in the presence of microsomes was used to generate sohible, tobeUed N and 
C. As shown in HGURE 8C, N but not C is depleted from these translation extracts by 
treatment with heparin agarose beads, while treatment with unmodified agarose beads did 
not deplete either fragment. Furthermore, N but not C was retained upon the heparin 
agarose beads upon extenave washing with a solution that comains 0. 1% Triton X-100 
and 150 mM NaCl; in contrast, neither fiagment was retained by unmodified agarose. N, 
but not C, binds tightly to heparin, and this behavior suggests that the low concentration 
of N released into culture supematants may be the result of binding to the extracellular 
matrix. Another mechanism that might contribute to the differential release of N and C 
into culture supernatant would be the expression in S2 cells of a receptor for N but not 
for C. Our current data can not distinguish these possibilities. 
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EXAMFLE7 

DISTINCT EMBRYONIC LOCALIZATIONS OF N AND C 

The difierential release of N and C into cultured cell supematants suggested the 
possibility that these fragments might also be differentially localized in embryos. 
Previously reported hh protein localizations utilized either antibodies specific for N 
epitopes or antibodies unable to distinguish between N and C. FIGURE 9 shows the 
differential localizations of N and C in embryos by in situ localization of the hh transcript. 
FIGURE 9 (A) is shown in comparison to the distribution of N and C epitopes detected 
with Abl and Ab2 in panels (9B) and (9CX respectively. Note that the distribution of N 
and C epitopes span approximately one-third and one-half of each segmental unit 
respectively, while the transcript is limited to approximately one-quarter of each unit. In 
(9DX the localization of C epitopes in embryos homozygous for the allele is detected 
with the use of Ab2. C epitopes in this mutant, which displays impaired auto-proteolytic 
activity are more restricted, and resemble the wild-type localization of N. Homozygous 
Mf"^ embryos were identified by loss of a marked balancer from a heterozygous parent 
stock. All embryos are at mid to late stage 9 (extended germ-band). 

FIGURE 9B shows in accordance with these reports, Abl, which is specific for N 
epitopes, reveals a segmentally localized distribution that is slightly broader than that of 
the hh transcript at the same stage (FIGURE 9A). Also consistent with these reports, we 
observed that N epitopes at later stages accumulate in large punctate structures. Our 
analysis here concentrates on the earlier stage, when antibody staining is weaker but 
before formation of the invaginations and grooves that later crease the epidermis and 
thereby complicate the interpretation. Ab2 was also utilized to detect C-specific epitopes 
with a variety of fixation and staining procedures. Although detection of C epitopes above 
background is more difficult than for N, we consistently observed a segmentally 
modulated pattern, albeit with a broader distribution than N (FIGURE 9C). This 
localization is also distinctive in that C epitopes at early or late stages are not found in the 
punctate structures characteristic of N. 
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The mutation encodes a prematurely truncated protein that is missing 15 residues 
normally present at the COOH-tenminus. Because this protein displays a much reduced 
efficiency in auto-proteolysis the distribution of C in this mutant background was 
examined. FIGURE 9D shows that C epitopes in a homozygous Afc*" embryo Odentified 
5 by absence of a marked balancer) are distnbuted in a much tighter segmental pattern than 
in wild-type. This localization resembles that of N, and we thus conclude that the broad 
distribution of C epitopes normally seen is altered in by retention of the uncleaved 
precursor near the site of synthesis. 

EXAMPLES 

,0 TRF OP Alixn-PROTEOLYSIS fN BIOGENESIS 

OF AmVE HEDG KHOC PROTEIN 

In addition to signal cleavage, the hh protein undergoes auto-proteolysis at an internal site 
to generate the predominant protein species observed in vivo. All or most of the amino 
acid readues required for this auto-proteolysis function map to C, the carboxy-terminal 

IS product of this imemal cleavage. In an effort to determine the importance of 
auto-proteolysis for function, we introduced a single residue mutation (H329A) that 
blocks auto-proteolysis of the hh protein in vitro and demonstrated that both processing 
and function of this protein is impaired in vivo. Since similar levels of induced protein 
were detected from a strain carrying the wild-type construct or fix>m several strains 

20 carrying independent insertions of the mutant construct (FIGURE 5), the impaired 
function of the H329A protein relative to wild-type is not the result of reduced levels of 
expression. Further evidence in support of a role for auto-proteolysis derives fi-om the 
efiect of the mutation, which reduces but does not eliminate auto-proteolysis of the 
hh protein in vitro (FIGURE 4). Correspondingly, the hh"^ mutation is associated with 

25 a phenotype of intermediate strength in vivo (J. Mohler, supra). 

Curiously, the H329A Hh protein appears to retain weak activity in embryonic signaling 
to induce ectopic wg e}q>res«on and, to a lesser degree, can function in imaginal disc 
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signaling for induction of ectopic and c^p expression. In contrast to its retention of 
at least some signaling functions in embryonic and imaginal tissues, the H329 protein is 
completely inert vAi&i assayed for the ability to reprogram ceO fates in the dorsal cuticle 
of the larva. 

5 The assays in which the H329A protein is active or partially active involve short-range 
signaling that normally occurs across one or at most several cell diameters; in contrast, 
the H329A protein fails to exert any effect upon patterning of the dorsal cuticle, a 
long-range activity that normally operates across most of the segment. Previous 
proposals to account for long-range patterning activities have suggested that hh 

10 expression induces other signaling molecules which are then responsible for executing 
the patterning functions (the signal relay model; see FIGURE 1 OA). FIGURE 1 0 shows 
a signal relay versus dual function models for hh protdn action. In FIGURE 10 (A), the 
long-range effects of hh signaling are achieved indirectly through short-range induction 
of a second signaling molecule PQ. Based on its biochemica] properties and its restricted 

15 tissue localization, N is presumed to represent the active short-range signal while the role 
of C would be limited to supplying the catalytic machinery required for biogenesis of N. 
In (lOB), the long- and short-range signaling functions of hh are supplied by the N and 
C proteins derived by internal auto-proteolysis of the U precursor. N is implicated in 
short-range signaling by retention near its cellular site of synthesis, while C is less 

20 restricted in its distribution and would execute long-range signaling functions. In both 
models, auto-proteolysis is required to generate fully active signaling proteins. See text 
for further discussion. 

These proposals seek to maintain a consistent mode of hedgehog action by rationalizing 
the apparent long-range activities of hh products as indirect consequences of short-range 
25 signaling. Based on the distribution observed, the active molecule in this model might 
be N and the role of C would then be limited to supplying the catalytic machinery 
required for biogenesis of N. 
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Our evidence suggests an alternative model, the dual fimction model (HGURE lOB). in 
which long- and short-range activities of the hh protein might be executed by N and C. 
the two predominant forms of the molecule observed in vivo. The marly quantitative 
release of C fragment into the culture medium of Wi-expressing S2 ceUs and its broad, 
though segmentally modulated distribution within embryos suggests that C might execute 
or contribute to long-range signaling fimctions. The N fragment, on the other hand, 
predominantiy remains associated with the expressing S2 cells and also binds to heparin, 
which suggests a possible association with the extracellular matrix. These properties and 
the segmemally restricted embryonic distribution of N are suggestive of a role in the 
execution of short-range hh signaling activities. Since the vertebrate Hh proteins we 
tested also appear to be auto-piocessed and also cany predicted heparin binding sites just 
caiboxy-tenninal to their signal sequences (H. Roelink et al., supra), many aspects of the 
dual function model discussed here in the context of Drosophila development may also 
j^ply to hh protein function in vertebrate development. 

Execution of short-range functions by N would be consistent with the observation that the 
H329A mutant protein has at least partial function in signaling for the induction of 
and 4}p, since this mutation does not alter residues located in the amino-tenninal portion 
of the protein that normally would give rise to N. The uncleaved H329A protein thus 
would carry all the residues that normally mteract with a p«5sumed receptor for N. 
although there might be some eflfect on the affinity of the interaction due to the presence 
of caiboxy-terminal sequences, thus accounting for the decreased potency of the H329A 
protein. Alternatively, the partial fimction of H329A protein may derive from an 
extremely small fraction of protein that appears to be cleaved, a very faim band with 
idemical mobility to C appears in in vitro translations with the H329A protein (FIGURE 
4. lane 3). Execution of long-range fimctions by C is also consistent with our 
observations because long-range signaling might require the release of the C fragment or 
otherwise require the H329 residue for some fimction other than for cleavage. 
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When N is synthesized from a native construct (wild type hh), it remains primarily cell- 
associated (FIGURE IOC), however, N generated from a truncated construct in cultured 
cells predominantly entos the culture medium (FIGURE lOD) (For constructs, see Porter, 
ei aL, Nature, 224^363, 199S). These results further confirm that autoprocessing by 
5 fragment C may regulate the degree of N association with the cell surface and therefore 
its range of action. 

EXAMP>.E9 

ISpy^nOW Qf HEPGEHPC HQMOMKfUlES 

The mouse and human hhAike sequences were isolated by polymerase chain reaction 
10 (PCR) using primers degenerate for all possible coding combinations of the sequences 
underlined in FIGURE 1 of Chang, eial., (Development, 12Q: 1994). PCR amplifications 
contained from 100 ng to 2 fig genomic DNA (depending upon the genome size of the 
species), 2 (iM of each primer, 200 pM dNTPs (Pharmacia), IX reaction buffer 
(Boehringer-Mannheim) and 2.S units Taq polymerase (Boehringer-Mannheim) in SO )il 
15 reactions. Amplification was as follows: 94 ''C 5 min, addition of Taq polymerase at 
75^C, followed by 94^C 1 min, 52-C 1.5 min and 72*C 1 min for 30 cycles and a final 
extenaon of 72''C for S min. AD PCR products were cloned into pBluescript (Stratagene) 
prior to sequence determination. 

Mouse clones obtained in this maimer contained 144 bases of sequence between the 
20 primer ends and were labelled with [a'^¥]dATP and used for high stringency screens of 
mouse cDNA libraries made from whole 8.5 dpc embryonic RNA and from 14.5 dpc 
embryonic brain in the AZAP vector (a gift from A. Lanahan). Several clones 
corresponding to///rg-l were isolated and the largest, 2629 bp in length (pDTCS O), was 
chosen for sequence analysis using dideoxy chain termination (Sanger, et ai, 1 977) and 
25 Sequenase v2.0 (US Biochemicals). Compressions were resolved by using 7-deaza- 
guanosine (US Biochemicals). Sequmce analysis made use of the Geneworks 2.0 
(InteiliGenetics) and MacVector 3.5 (IBI) software packages. 
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One of the three mouse dones, fl»g-l. when used as a probe, yielded a 2.0 kb clone from 
a 8.5 dpc mouse embryonic cDNA Ubraiy and a 2.7 kb done from a 14.5 dpc embryonic 
dJNA Bbiaiy. The 2.7 kb cDNA appears to represent a nearly full length mRNA because 
it corresponds to a 2.7 kb band detected by hybridization on a Northern blot. The largest 

9 methiorano-initiated open reading frame within this d)NA encompasses 437 codons, and 
is preceded by one in fiwne upstream stop codon. Sequence comparisons indicate that 
the protein encoded by Hhg-\ is identical to the independently characterized mouse Shh 
(Echdard, et al.. Cell, 75: 1417-1430, 1993) except for an arginine to lysine difference at 
residue 122. Hhg-\ also corresponds closely to the rat vhh-\ gene (97% amino add 

10 identity; Roelink, et al.. Cell. 2fi:761-775, 1994), the chicken Sonic hedgehog (81% 
identity; Riddle, et al.. Cell. 75:1401-1416, 1993) and Shh from the zd>rafish (68% 
identity; Krauss. et al.. Cell, 21:1431-1444, 1993; Roelink, et al.. Cell. 26:761-775. 
1994). The PCR-generated fragments Hhg-2 and Hhg-3 appear to correspond to the 
Indian and Desert classes of mouse hedgehog genes, respectively (Echdard, et al.. Cell. 

15 25:1417-1430.1993). 

Alignment of the Hhg-l open reading frame with the two Drosophila hh sequences 
showed that all three proteins contain hydrophobic amino add sequences near thdr 
amino-termini; the hydrophobic stretches within the D. melanogaster protdn (residues 
64 to 83) and within the mouse protein are known to act efiidently as signal sequences 
20 for deavage (Lee. et al. Cell, 21:33-50, 1992). Both Drosophila signal sequences are 
unusual in their internal locations, whUe the hydrophobic stretch of the mouse gene 
occurs at the extreme amino-terminus, a more conventional location for cleaved signal 
sequences. Ahhough portions of sequence N-terminal to the Drosophila signal sequences 
are conserved, suggesting a flmctional role, the mouse gene lacks this region. 

25 The overall levd of amino add identity between Hhg- 1 and hh carboxy-terminal to the 
»gnal sequences is 46%. A closer examination shows that the amino terminal portion, 
from residues 25 to 187, displays 69%,identity, while remaining residues in the carboxy- 
terminal portion disptoy a much lower 31% identity. Like Wi, the ///ig-1 coding sequence 
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is divided into three exons, and the boundaries of these exons are at the same positions 
within coding sequence as those of the three Drasophila hh exons. Curiously, the 
boundary between coding sequences of the second and third exons occurs near the 
transition from high to low levels of overall sequence conservation. The coincidence of 
5 these two boundaries suggests a possible demarcation of functional domains within these 
proteins. This location within Hhg-\ coding sequence also coincides approximately with 
the site of a presumed proteolytic cleavage. 

HUMAN CLON ING OF hh GENES 

10 Partial sequence for two human hh genes has been obtained by DNA sequencing of 
clones derived by PGR amplification from genomic DNA with A/i-specific degenerate 
primers as outlined in Chang, et aL, (Development, 12Q:3339, 1994) and EXAMPLE 9 
(FIGURE 1 1 A and B). More extensive screening by the same approach, either with the 
same primers or with other primers from the hh coding region or with the human hh 

15 fragments seen in FIGURES 1 1 A and B, is expected to yield at the least a third gene, and 
possibly more, since at least three genes are found in the mouse. These segments of 
human hh genes can be used to obtain full coding sequences for human proteins by the 
following cloning method commonly used by those of skill in the art and which are 
extensively described in the literature. 

20 For example, ready-made cDNA libraries or RNAs from a variety of human sources, 
including various fetal stages and organs (from abortuses) and specific infant or adult 
organs (from pathological or autopsy specimens), are being tested for the presence of hh 
sequences by PGR or RT-PCR using the primers described in Chang, el aL, supra, and 
other primers derived directly from the sequence of the human fragments. Ready-made 

25 libraries containing hh sequences are being screened directly and, where necessary, new 
libraries are being constructed by standard methods from RNA sources containing hh 
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sequences. The probe for these screens is a mixnire of all the distinct human hh 
fiagments. Sequences of cDNA clones can then be determined. Most clones containing 
the probe sequences, which are located in the N region, will also include a full C coding 
r^on snce standard methods of library construction result in cDNA clones that are most 
complete at their 3' ends. All fall length Mt-coding sequences obtained previously in 
vertebrates and invertebrates contain N and C sequences encoded in a single RNA. 
Screening is continued until complete open reading frames that correspond to all of the 
fragments of human hh genes are obtained. Specifically, 1.2x10*^ clones from a human 
fetal brain library (Stratagene, La Jolla, CA) was screened using a mixture of the two 
human hh fragments (HGURE llA and B) as probes. Twenty-nine clones were 
identified as specifically hybridizing with these probes. 

Second, the RNA sources identified as containing hh sequences can be used as templates 
from anchored PGR (also referred to in the literature as RACE, for rapid amplification 
of cDNA ends). Briefly, this method provides a means to isolate farther mRNA sequence 
in either the 5' or 3' direction provided that sequence is known firom an internal starting 
point. Anchored PGR can also be used to isolate sequences from cDNA library. 

Third, genomic libraries can be screened with the probes described in the first technique. 
Where necessary, human hh exons and coding sequences are being identified by 
hybridization to previously isolated human and mouse coding sequences by sequence 
determination, and by exon-trapping methods to identify all hh coding sequences within 
genomic clones; these coding sequences can be ''stitched" together by standard 
recombinant DNA methods to generate complete hh open reading frames. 

FIGURE 12 A and B show in vitro cleavage reactions of a Drosophila hh protein 
produced in E. coli and purified to homogeneity. This protein has residues 89-254 
deleted, rendering it more soluble and easier to purify. It also contains a Hise purification 
tag appended to the N-terminus. Autoproteolysis of this protein is triggered by the 
addition of reducing agents (DTT), and the resulting product corresponds to the C 
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fragment identified in vivo. FIGURE 12, Panel A shows a time course of cleavage after 
initiation by addition of DTT. Panel B shows incubations of concentrations ranging over 
three order of magnitude for a fixed time period (four hours), with no difference in the 
extent of conversion to the cleaved form. This concentration-independent rate of 

5 cleavage indicates an intramolecular mechanism of cleavage. Panel C shows the 
sequence around the cleavage site as determined by amino-terminal sequence of the 
cleaved fragment C. The cleavage site is denoted by the arrow, and the actual residues 
sequenced by Edman degradation of the C fi-agment are underlined. Panel C also shows 
an alignment of all published vertebrate hh sequences plus some of unpublished 

10 sequences from fish and Xenopus. The sequences shown correspond to the region of 
. Drosophila hh v*ere tiie cleavage occurs, and demonstrates the absolute conservation of 
the Gly-Cys-Phe sequence at the site of cleavage. Panel D shows a SDS-PAGE gel 
loaded with in vitro transcription/translation reactions as described in the previous 
Examples, using various hh genes as templates, dhh is Drosophila, twhh and z£shh are 

15 the twiggy-winkle and sonic hh genes of the zebrafish, and mshh is the shh/Hgh'l/vhh-l 
gene of the mouse. The translation mix included '^S-labelled cysteine, used to visualize 
the resulting products by autoradiography. Note that each gene give a larger product (the 
precursor or .U) and two smaller products of cleavage (N and C). The larger species is 
C for each of the vertebrate genes, whereas the Drosophila N is larger than C due to the 

20 presence of -60 residues occurring amino-terminal to the signal sequence that are present 
in the vertebrate open reading fi^e. This panel shows that vertebrate hh proteins are 
processed similarly to the Drosophila protein. Panel E shows that Edman degradation 
of the C Augments releases "S counts on the first but not subsequent rounds for all these 
proteins, indicating that the site of autoproteolytic cleavage for all of these hh proteins 

25 is the amide bond to the amino-terminal side of the Cys residue that forms the center of 
the conserved Gy-Cys-Phe sequence highlighted in panel C. This is a generalizable 
approach to establish the compo^tion of protein fragments from any other hh family 
members. 
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FXAMPLEU 

MFsnnFWM AND ]m mmfk^ PPOCTNITORS. 

Partial sequences corresponding to five distinct zebnifish hh-Vke genes were isolated and 
the complete coding sequences for two of these genes were obtained from an embiyonic 
cDNA Hbrary. One of these two sequences is identical to that of the zebrafish nhh-I gene 
(RoeBnk, etal.. Cell, 76:761. 1994), and appears to correspond to the shh gene reported 
by Krauss. et al.. (Cell, 251431, 1993) (See FIGURE 13 description); the other gene. 
tiggywinkle (Potter, B.. The Tale of Mrs. Tiggy-Winkle, The Penguin Group, London, 
1905), represents a novel vertebrate hh. Coding sequences for both are shown in 
afignment to mouse and chicken sequences of the somcA^hh-J class (FIGURE I3b). Like 
other vertebrate hh homologues, the twhh and shh proteins contain an araino-terminal 
stretch of hydrophobic residues. These residues function as signal sequences since 
cleavage is observed when coding sequences are translated in the presence of micro- 
somoses; vertebrate hh genes thus appear to encode secreted proteins, as previously 
reported for Drosophila hh (Kimmel C.B. & Warga, R.M.. Developmental Biology, 
- 124:269-280, 1987; Warge, R.M., & Kimmel, C.B., Development, lflS:569-580, 1990). 

The first four sequences were isolated from zebrafish genomic DNA (a gift from J. 
PeUegrino) using degenerate primers in polymerase chain reactions as described (Chang, 
etal.. supra), twhh and shh clones were isoUted from a 20-28 hour cDNA Ubrary (a gift 
from R. Riggleman, K. Helde, D. Giunwald and J. PeUegrino) using the first three 
sequences as probes. The translational reading fiiunes for twhh and shh were closed 12 
and 16 codons, respectively, upstream of the putative initiating methionine. 

Figure 13 shows the predicted amino acid sequences are shown in single letter code. 
13(a) shows sequences common to five distinct «»-like genes are shown with a dot 
indicating identity with the corresponding residue of zebrafish twiggywinkle (twhh. 
Potter 1905; supra), hh[zfB) and hh[zfC] is more diverged and appears to represem a 
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novel class. 13(b) shows amino add sequences of twhh and shh are aligned to those of 
the soniclvhh'l class from chick and mouse (Riddle, et al,. Cell, 25:1401-1416, 1993; 
Chang, D.T., e/o/.. Development, supra\ Echelard, Y., et al.. Cell, 7S: 143 1-1444, 1993). 
Zebrafish sonic hedgehog (shh) is identical in sequence to r-vWi-/ reported by Roelink, 

5 et al. Cell, 76761-775, 1994. Based on expression and extensive sequence identity 
throughout most of the coding region, v/i/r-/ and the sonic sequence reported here 
probably correspond to shh of Krauss, et al. Cell, 211431-1444, 1993, diverges 
dramatically throughout a 26 residue stretch near the carboxy-terminus. Rat vhh-I/sonic 
hh (Roelink, et al, susii ) was excluded in this alignment because of its 97% sequence 

10 identity to the predicted mouse protein. Residues identical in all four sequences are 
boxed, and a dash indicates a gap in the alignment. The arrow indicates the predicted 
signal sequence deavage site (von Hdjine, G. , Nucleic Acids Res, , 14, 4683-4690, 1 986) 
for twhh. The amino-terminal hydrophobic stretch conunon to all four hh genes is 
shaded. 13(c) shows percent identity of residues carboxy-terminal to the hydrophobic 

15 region. 

Figure 14 shows a comparative expression of twhh, shh, and pax-2 during zebrafish 
embryogenesis. Whole mount in situ hybridizations on 0-36 hour embryos were 
performed using a modification of the procedure of Tautz and Pfeifle, Chronosoma, 
28:81-85, 1989, with antisense probes. Transcript localization is revealed by the purple 

20 product of an alkaline phosphatase enzymatic reaction. Staging of the embryos is 
according to Westerfield, M., (The Zebrafish Book, University of Oregon Press, Eugene, 
1993). Transcripts were visualized by in situ hybridization to whole embryos, (a, b) twhh 
expression in a single late shield stage embryo, (a) Dorsal view, animal pole is to the top. 
The triangular shape of expression is characteristic of axial mesoderm-forming cells of 

25 the hypoblast (Statchd, S,E., et aL, Development, 112:1261-1274, 1993). (b) Lateral 
view: the thicker layer of cells on the left (dorsal) side of the embryo is the embryonic 
shield; the two arrows indicate the fw^M-expressing hypoblast cells and the non- 
expresang epiblast. Anterior is to the left in all subsequent embryos. Dorsal is to the top 
in all lateral views, (c, d) A single embryo at the end of gastrulation (100% epiboly) 
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with /ivM-expressing cells, (d) Caudal-dorsal view. Note the wide patch of stain in the 
presumptive tailbud which narrows anteriorly, (e, j) Early soniitogenesis (1 1 S hour, 3-4 
somite) embryos; optic veacles have not b^n to evaginate from the wall of the 
diencephalon. (e, h, k) Lateral views of developing brain, (f, i, 1) Dorsal views of 
developing brain, (e, f, g) Localization of Av/z/r-expressing in a single row of cells that 
will form the flood plate. The arrowhead marks a parch of rwAA-expressing cells lateral 
to the tailbud. (h, i, j) Localization of shh, shh is also expressed strongly in the 
protuberance. Q) Lateral view of the developing tail, shh is also expressed strongly in 
the protuberance, (j) Lateral view of developing tail, shh is expressed in cells that will 
form both floor plate and notochord. (k, 1, m) Localization of pcDc-2 during early optic 
vesicle formation; (m) also shows iwhh expression, (k) 12 hour (4-5 somites) embryo. 
(1) 12.S hour (S-6 somites) embryo. Expression ofpax-I in the developing optic vesicle 
is in a gradient away from the protuberance. Note the expression of pax-2 (asterisk) at 
the future midbrain-hindbrain border, (m) twhh (arrow) and par-2 expression in a 6-7 
somite (13 hour) stage embryo. Note differential expression of twhh in ventral neural 
keel (corresponding to neural tube in other vertebrates), (n-s) Embryos at end of somito- 
genesis (22-24 hours), (n, o, p) Localization of /m^/tA. (n, o) Developing brain. Note 
isolated groups of cells staining in the diencephalon (filled triangles) and the 
protuberance (arrowhead), and floor plate expression underlying the midbrain and 
hindbrain. The floor plate expression is contiguous caudally along the axis, (n) Lateral 
view, (o) Dorsal view, (p) Lateral view of tail. Expression is restricted to the floor plate 
(q, r, s) Localization of shh, (q, r) Developing brain (q) Lateral view. pcDc-2 expression 
in the otic vesicle is indicated, (r) Dorsal view. Expression in the protuberance 
(aiTowhead) and in the neural ked. (s)Lateral view of tail. Expression is strongest in the 
floor plate, but contrary to the report of Krauss, ei al„ aipm ^ is still also in the notochord. 
Abbreviations: white e - epiblast; h - hypoblast; tb - tailbud; p - protuberance; c - eye; 
ov - optic vesicle; ot - otic vesicle; fp - floor plate; nc - notochord; asterisk - midbrain- 
hindbrain boundary or par-2-labeled prospective midbrain-hindbrain boundary; t - 
telencephalon. 
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Comparison of twhh and shh expression patterns (Krauss, ei al, supra), reveals that both 
gene are predominantly expressed in midline stnictures, albeit with notable differences 
in regard to timing, rostra-caudal extent, and tissue restriction. Expression of twhh is first 
detected during gastrulation in the dorsal mesoderm (FIGURE 14a, b); this expression 
occurs in a band corresponding to a subset of the embryonic shield, a structure, analogous 
to Spemann's organizer in Xenopus {Slzchel, ei aL, Dev., ill: 1261-1274, 1993, and 
reference therein; Ho,, R., Seminars in Developmental Biology, pg.3, 1992). In concert 
with the movements of convergence and extension, this band of twhh expression shortens 
along the equatorial plane and extends along the incipient embryonic axis until, by the 
end of gastrulation, expression occurs throughout the entire axis (FIGURE 14c,d). Early 
in somitogenesis, twhh RNA is found restricted to presumptive ventral neural tissue along 
the entire body (FIGURE 14e, I g), the only exception being cells in and near the tailbud 
(FIGURE 14g). In contrast to the neural restriction of twhh, shh is localized both to 
presumptive neural and notochordal cells (FIGURE 14j). 

As somitogenesis proceeds, ventral midline expression of shh and twhh is reduced in most 
of the prospective forebrain, but renuuns strong in an anterior patch of midline cells 
within the floor of the prospective diencephalon (FIGURES 14e, f, for twhh; FIGURES 
h, i for shh). This patch later will give rise to the protuberance (Schmitt, E.A. and 
Dowling, J.D., y. Camp. Neur, 244: 532-542, 1994), an anterior extension of the 
diencephalon. This structure, which is medial and just rostral to the developing optic 
stalks, is the site we propose as the focus of early patterning activity for the developing 
eyes (see below). By the end of somitogenesis, both twhh and shh are strongly expressed 
in the floor plate (FIGURES 14p, s), although shh transcripts remain detectable in the 
notochord at this stage and at 36 hours of development (FIGURES 14s; later stage not 
shown). At 28 hours, twhh transcripts are also found in a small cluster of cells within the 
first gill arch (not shown), as also reported for shh at 33 hours of development (Krauss, 
et ai, supra). 
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Differraces between twhh and sWr expression are apparent from the beginning of 
gastnilation, since twhh RNA can be detected as early as the shield stage ^e is first 
detected later, at about 60"/o qnboly (not Aavm, (Krauss, et aL. supra). In addition, twhh 
transcripts are restricted to neural tissues eariy in development, and are never detected in 

5 the notochord (compare FIGXJRE 14g to HGURE 14j). Later differences in expression 
include differential rostra-caudal restriction within the diencephalon and midbrain and 
weako- and more restricted expression of twhh in the protuberance (compare FIGURES 
14n and 14q), such that the later domain of twhh expression in the brain appears to 
constitute a subset of the shh domain. In addition, shh but not twhh is expressed in the 

10 developing fin bud (Krauss, et ai, supra). Comparison of shh and twhh expression 
patterns to this previously reported for hh homologues in zebrafish and other vertebrate 
species indicates that shh is the zebrafish homologue of the sonic/vhh-1 class while twhh 
represents a novel class of vertebrate hh. 

15 DEVELOPMENTAL CONSEOIIFNCES OF ECTOPI C 

RYPRESSION mmiNr, ZEBRAFISH EMBRYOGENESIS 

To gain insight into the potential roles ofhh products in development, synthetic twhh and 
shh mRNA was injected into 1-8 cell embryos. This technique yields a mosaic but fairly 
uniform pattern of expression, as determined for the control mRNA encoding P- 

20 galactosidase (not shown). Uniformity of expression is in good agreement with fate 
mapping studies of the early zebrafish embryo (Kimmel & Warga, supra\ Warga & 
Kimmel, supra\ Heide, et aL Science, 265:517-520, 1994), which indicate that 
blastomeres undergo extensive cell mbdng during the cleavages prior to gastnilation. We 
note that mosaicism of expression caused surprisingly little variation in the phenotypes 

25 of the M injected embryos, possibly due to secretion of hh gene products. 

Embryos injected with synthetic twhh or shh mRNA {hh RNA) exhibited numerous yet 
highly reproducible abnormalities in comparison to control embryos injected with lacZ 
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mRNA. These abnormalities, discussed below, are primarily defects in the brain and 
eyes. Although the efifects of ectopic twhh and Mi expression were qualitatively simUar, 
the incidence and severity were greater with twhh RNA (see text below, FIGURE 1 S and 
FIGURE 16). The proteins encoded by these two genes have qualitatively similar 
5 biological activities, but apparent differences in potency. 

FIGURE 15 shows the effects of ectopic hh on zebrafish development. Wild type 
zebrafish, Danio rerio, Ekkwill Waterlife Resources) were maintained at 28.5*C, some 
embryos were then cultured overnight at RT, Zebrafish embryos were injected at the 1 -8 
cell stage with twhh, shh, or /acZRNA and examined at 28 h of development, (a-c) 

10 Dorsal view of the midbrain-hindbrain region; anterior is left, (a) lacZ. (b) twhh (c) shh 
(d-f) Frontal optical section of tfie forebrain region; anterior is up. (d) /acZ, (h) twhh, (f) 
shk (g-1) Lateral view of the eye region; anterior is left, (g) lacZ. (h) twhh, (i) twhh. At 
levels caudal to the prospective brain, the notochord, somites, and neural keel formed by 
most /i/r-injected embryos appeared grossly normal except for an overall shortening and 

15 dorsal curvature of the axis. A minority of /r/i-injected embryos (15% are not shown) 
displayed partially bifiircated axes, containing duplicated axial mesoderm and parallel 
neural keels, each neural keel comprising ventral midline cells and some bilaterally 
symmetric lateral cells (not shown). Although we have not determined the primary cause 
of these axial defects, analysis of late gastrulation stage embryos suggests hat the 

20 bifurcation may result fi-om difficulties in epiboly and convergence. Abbreviations: mv - 
mesencaphalic ventricle; rv - rhombencephalic rentricle; asterisk - midbrain-hindbrain 
boundary; ot - otic vesicle; tv • third (diencephalic) ventricle; r - retina or retina-like 
structure; 1 - lens or lens-like structure; pe - pigmented retinal epithelium. 

Morphological defects in the brain and other rostral neural derivatives occur at high 
25 frequency in /j/i-injected embryos. The three ventricles of the fish brain normally 
apparent at 28 hours of development - the rhombencephalic. mesencephalic (FIGURE 
15aX and diencephalic (tfiird ventricle; FIGURE 1 5d) - are not formed in the brains of hh 
injectees (FIGURES 15b, c; FIGURES 15e, f)), despite the obvious presence of a lumen. 
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The prominent construction normaHy present at the midbrain-hindbrain boundary also is 
absent (compare FIGURE 15a to FIGURES 15b. c). Formation of this constriction 
requires function of pax.2 (Krauss. et oL. Atoure. 252:267-270, 1991; Krauss. et al.. 
Nature, 26(i:87-89, 1992). which norroaUy is expressed in a band at the midbrain- 
hindbrain boundary (Krauss. et al., supra, Krauss. et al. Development, 112:1 193-1206. 
1991) pax2 expression at this boundary is not disrupted by /rA RNA injection, however, 
indicating tiiat tiiis phenotype does not resuh from disruption of rostra-caudal 
information. 

Defects in eye development also occur at high frequency in embryos injected with hh 
RNA. Thus. whUe at 28 hours the normal zebrafish eye has a lens and a retina with 
pigmemed epitheUum (FIGURE I5d. g). Wi-injected embryos usually fail to develop 
lenses and retinal pigmentation (HGURE 15e, h). Eye dupUcations are also observed at 
low frequencies OFIGURE 15i). The poorly developed eyes do not appear to result from 
a simple delay in development since pigmentation elsewhere in injected embryos appears 
in its normal time course. Examined at three days of development, tiie consequences of 
hh RNA injection inchide defects tiiat range from complete absence of eyes to partially 
formed eyes ladng a ventral portion of the retina. 

The eye phenotypes caused by hh RNA injection resemble those produced by treatment 
of zebrafish and Xenopus laevis embryos with retinoic acid. In Xenopus, phenotypes 
range from reduction of tiie eye and absence of tiie lends to eyes with retinal folds 
(resembling duplicated dyes) and multiple small lenses (Manns. M. & Fritzsch. B.. 
Neurosci. Lett., 122 150-154, 1991) In zebrafish. exposure to retinoic acid during 
gastiulation interferes witi) tiie formation of the eye (Holder. N. & Hill. J., Development, 
112:1 159-1 170. 1991). while exposure during formation of the optic primordia induces 
formation of duplicated retinas and extra lenses (Hyatt, et al., Proc. Natl. Acad Sci. USA, 
59:8293-8297. 1992). Patterning effects of retinoic acid upon tiie developing chick limb 
appear to be mediated tiirough ectopic activation of tiie endogenous sonic hh gene 
(Riddle, et al.. supra), tiiese results witii ectopic hh expression suggest tiie possibUity of 
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a similar mechanism underlying the patterning effects of retinoid acid treatment in the 
vertebrate eye. 

EMMTLE P 

hh EXPRESSION IN THE OPTIC VESICL E SPECIFIES 
5 PROXIMAL FATES AT THE EXPENSE OF DISTAL FATES 

To further elucidate the role of hh in eye development we utilized /xjx-2 and pax-6 
(Krauss. etal, EMBOX, lfl:3609-3619, 1991; Pitischel, etaL, Development, 114 643- 
651, 1992) were utilized as positional markers to examine the effects of ectopic hh 
expression on the optic vesicle. As the optic vesicle evaginates from the lateral walls of 

10 the zebrafish forebrain (Schmitt, E.A. & Dowling, J.D., J. Comp, Neur., 344:532-542- 
1994), pax^2 is expressed in a gradient, with highest RNA levels in the anterior and 
ventral regions of the optic vesicle (Krauss, et aL, supra, FIGURE 14k, 1, m). 
Immediately adjacent to the maximum of this pax-2 expression gradient is the region of 
the dioiecphalon tenned the protuberance (Schmitt & Dowling, supra), where both twhh 

15 and shh but not ptax-2 are strongly expressed (FIGURES 14e, f, h, i, m). The 
concentration gradient of pax-2 expression in the eptic vesicle thus appears to incline 
downward from its maximum at a location adjacent to the site of twhh and shh expression 
in the protuberance. Superposition of developmental fate within the optic vesicle 
(Schmitt, et aL, supra), upon the pattern of pax'2 expression suggests that the gradient 

20 of paX'2 RNA prefigures the future proximal/distal axis of the eye. 

Ectopic hh alters the expression of paX'2, pax-6, and F-spondin. Zebrafish embryos were 
injected at the 1 -8 cell stage with twhh or shh RNA and the pattern of pax-2, pax-6, or F- 
spondin expression was examined by whole mount in situ hybridization. Control 
embryos injected with lacZ RNA were performed in every case and displayed wild-type 
25 expression patterns. At embryo stage, the anterior-posterior axis of the optic vesicle 
corresponds to the future proximal-distal axis of the eye. During the next hour of 
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devdopment, the posterior edge of the optic vesicle will separate from the diencephalon 
(Schmitt and Dowling, Comp. Neur., 244:532-542, 1994). 

Injection of dther hh RNA causes umform initiation of pcDc-2 repression along both the 
proximal-distal and dorsal-ventral axes of the optic vesicle as it begins to evaginate. The 
ectopic pax'2 expression appears at the same time as normal pax'2 expression is initiated 
in the eye, and in some cases, is also seen in the diencephalon between the optic vesicles 
At the end of somitogenesis, a time when pax-2 would nonnally be restricted to the optic 
stalk, paX'2 RNA in hh injected embryos is detected in all but the most distal portion of 
the optic vesicle. 

the effects of ectopic hh on expression of por-tf, which encodes a transcription factor 
critical for eye development was also studied. At 22 hours of zebrafish development, 
pax'6 is normally expressed in the lens and in most of the distal part of the optic cup 
(Krauss, etal., stqira, Puschel, et aL, Development, 114:643-651, 1992). In AA-injected 
embryos, /mx-5 is repressed in the optic vesicle, although many embryos retain pax-6 
expression in the most distal cells. With regard to paX'2 and pax'6 as markers of 
positional identity, hh expression in the optic vesicle can be characterized as inducing 
proximal fates and repressing distal tales. 

The distal part of the optic vesicle is the most refractory to Wi-induced changes in both 
pcDc-2 and pax'6 gene expression. Due to a later rotation, this distal portion of the optic 
vesicle will give rise to the dorsal portion of the mature eye (Schmitt, ei aL, supra), 
interestingly, this is the portion of the eye that remains in 3 -day old injected embryos with 
intermediate phenotypes (see above). 

Lesions in the paX'6 gene have been assigned as the basis for the Aniridia (Ton, ei aL, 
Cell, 67:1059-1074, 1991; Glascr, etal,, Nat. Genetics, 2:232-239, 1992), Small eye (Hill, 
ei aL, Nature, 254:522-525, 1992), and eyeless mutations (Quiring, et aL, Science 
2^5:785-789, 1994), in hunums, mice and Drosophila, respectively; por-tf fiinction thus 
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appears to be critically required for eye development in Drosophila and mammals. As 
we argue here, /r/»-encoded activities also appear to play a role in vertebrate eye 
development, and this suggests a further molecular parallel between vertd>rates and 
insects, since the role oihh in Drosophila eye development is well established (Mohler, 

s el al,. supra, M&,etal., supra, Heberiein, et al. , supra, Lee, et ai, supra). The reciprocal 
and non-overlapping patterns ofhh and pax-6 expression in the developing Drosophila 
eye (Ma, et ai, supra. Quiring, et ai. Science, 26^:785-789, 1994), suggest the 
possibility oi pax-6 repression by hh, but whether hh functions by similar mechanisms 
in vertebrate and Drosophila eye development is a questions that requires further 

10 investigation. 

In mice, the dosage of pax-6 protein is crucial for normal eye development (Hill, et ai, 
supra). Small eye heterozygotes develop an abnormally snuai lens (Hogan, et ai. J. 
EmbryoL Exp. Morph. 22:95-110, 1986; Hogan, et al.. Development, 1Q2 Suppl.. 1 15- 
1 19, 1988), as do /kfr-irijected embiyos with weaker phenotypes (FIGURE 14f). Small eye 
IS homozygotes lacking lenses eventually generate and the animals lade eyes at birth 
(Hogan, et al.. supra, Hogan, et ai, sigjra), as do many of the hh-irgecxed embryos at 
three days of development. These parallels suggest that many of the later eye defects 
observed in /lA-injected zdnafish may be caused by partial or complete repression of pax- 
6 during eye development. 

20 EXAMPLE 14 

r.FNFnr ablation o f hh forebrain EXPRESSION CAVSES LOSS QF 

y^^lXIMAI. FATES IN THT, OPTfC VESICLE 

The patterns of twhh and shh expression (FIGURE 14) and the efferts of ectopic hh 
expression (FIGURE 15) are (Consistent with a normal role for shh and twhh in eye 
25 development. If M activities indeed play a normal role in promoting proximal fates 
within the developing eye, removal of Wr activities would be expected to result in a loss 
of proximal fetes. In embryos homozygous for the cyclops mutation ventral neural 
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structures feil to forai and the developing eyes fuse at the midline,, yielding an embryo 
with a single eye (Hatta. et al.. Nature, 252:339-341. 1991). The missing ventral 
structures in cyclops mutants include the regions where we observe expression of twhh 
and shh, and we therefore examined the effects of the cyclqps mutation on hh expression. 

5 eye*" (Hatta, et a!.. Nature, 15fi:339-341. 1991), heterozygous adults (a kmd gift of R. 
Riggleman) were spawned and their offspring analyzed by whole mount in situ 
hybridization. Detection of pax-2 and either twhh or shh RNAs in embryos homozygous 
for the eye mutation or their wild-type sittings, twhh RNA is only expressed in the 
presumptive tailbud (caret) of eye embryos. As reported by Krauss, et al., Cell, supra, 

10 neural expression of shh is abolished in eye embryos. Strong pax-2 expression was 
observed in the optic vesicles of wild-type embryos which is significantly reduced in eye 
mutant embryos. 

twhh RNA in cyclops embryos is found only in a small patch of cells at the presumptive 
tailbud and neural expression was not detected at any later stage examined. Neural 
ts expression of shh is also lost in eye mutants, although expression in the notochord is 
rainited (Krauss, et al.. supra, data not shown). 

Since the eye mutation appears to ablate A/»-expressing cells in the developing brain, this 
mutation can be used as a genetic tool to examine tiie requirement for hh function in eye 
development. Datta, et al.; Hatta, et al.. Proe. Natl. Acad Sei. USA, 21:2061-2065, 

20 1 994), recently demonstrated that pax-6 expression is fused at the midline due to loss of 
ventral midUne cells that normally do not express pax-6 and, in addition, pax-2 
expression in the fiised eye of eye mutart embryos is reduced. We extended these 
observations to an earlier stage when the optic vesicles first form and found that pax-2 
expression is weak and foils to extend within the vesicles in eye mutants. In conjunction 

2S with the resuks of ectopic hh expression, these observations suggest that hh signaling that 
activity promotes and is required for the induction of proxima fates within the eye vesicle. 



W096n7924 



PCT/US95/1546J 



-80- 

In this model, we propose that the protuberance acts as a proximal patterning center for 
the developing zebrafish eye by providing a localized source of hh activity. 

EXAMPLE IS 

hh ACTIVITY VENTRALIZES THE DEVELOPING BRAIN 

5 Previous work has established an important role of signals from the floor plate and 
notochord in ventral patterning of the neural tube (Jessell, T.M., & Dodd, J., Cell, 69:95- 
110, 1992). For example, Goulding, et al„ Development, 117: 1001-1016, 1993, recently 
demonstrated that notochord and floor plate grafts can repress the normal lateral 
expression of pax'6 in the neural tube. Other recent work has implicated hh activity in 

10 at least some aspects of ventral neural tube patterning (Echelard, et aL, Cell, 21' 1417- 
1430, 1993; Krauss, etal., supra, Roelink, et al., supra); consequently, we examined hh- 
injected embryos for effects on expression in the brain. 

In the zebrafish at 22 hours of development, /7ar-6 is expressed in dorso-lateral regions 
of the diencephalon and in a ventro-lateral domain of the hindbrain and spinal cord that 

15 excludes the floor plate and adjacent cells (Krauss, et al, supra; Puschel, et aL, supra). 
This pattern of expression is reciprocal to that of both twhh and shh in the diencephalon 
(corhpare FIGURES 14q and 14i) and in the hindbrain. hh RNA injection caused 
repression of pax-d in the more ventral domain in the diencephalon, while more dorsal 
expression persisted. In addition, pax-6 expression was significantly reduced ventrally 

20 in rhombomeres 1, 2, and 4 and, in some cases, was completely abolished in these 
rhombomeres. The repressing effect of ectopically expressed hh and pax-6 in normal 
embryos are due to repression of pax'6 by nearby hh expressing cells. 

Since absence of pax-6 expression is a feature of the ventral midline, repression of par-6 
in lateral positions suggests ventralization. Consequently, twhh was injected into 
25 embryos for analysis of induction of a floor plate marker, F-spondin (Riddle, et aL, 
supra). As described above, ectopic twhh induces F-spondin expression at more dorsal 
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levels in the midbrain and anterior hindbrain. The eflBxts of hh upon expression of both 
pax-e and Fspondin indicate a ventralization of the brain. Adoption of ventral cell 
identity by lateral cdls might explain thar feilure to fonn ventricles (FIGURE 1 5a-f). 

The ventralizing activities of twhh confirm and extend those previously reported for 
shh/vhh-1 class genes of chicken, zebrafish, and rat (Echelard, et al., supra, Krauss, et 
ai, supra, Roelink et al.. supra). The eariy restriction of twhh to midhne neural 
progenitors, however, suggests that it may play a specific role in the homeogentic 
mechanisms of floor plate maintenance and expansion (Placzek, et ai. Dev., 112:205-218, 
1993). In the zebrafish, wild type cells in cyclops hosts can contribute to and induce 
adjacent cdls to form floor plate, but only when the transplanted cells populate the neural 
plate and not the notochord (Hatta, et al.. Nature, 3ifi:339-341. 1991). We have 
demonstrated that, in cyclops mutants, midline expression of twhh is lost whUe shh 
expression is maintained in the notochord (HGURE 18; Krauss, et al.. supra for shh), 
taken together, these results suggest that the homogenetic floor plate ngnal lost in the 
cyclops mutant may be encoded by the twhh gene. In the chick and rat, the floor plate 
retains auto-inductive potential long after the loss of floor plate inducing properties by 
the notochord, despite continued oqnession of shhAMil in tiie notochord (Roelink, et al.. 
supra, Placzek, et al, supra, Yamada, et al.. Cell, 73:673-686, 1993). Although no 
homologues of the twhh class have been reported in other vertebrates, expression of other 
hh homologues in patterns more like tiiose of twhh might help explain tiiese 
discrepancies. 



EXAMPLE 16 

Twn msTiNrr sign aling proteins DERIVE FROM 
THF p-i-i'-mrnnvTi PWEruRSOR 

Endogenous hh protein in Drosophila is fiind predominantly as an anuno- and a carboxy- 
terminal fiagment (N and C, respectively) derived by an internal auto-proteolytic 
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cleavage of a larger precursor (U for uncleaved), which also occurs in vivo but at lower 
levels (Lee, et al., supra), Detenninants within the amino-terminal domain appear not 
to be required for auto-proteolytic activity, whereas mutations affecting the carboxy* 
terminal domain can blodc auto-proteolysis and reduce activity in vivo (Lee, el al. 
supra). The auto-proteolym is blocked by a substitution of alanine for the histidine 
normally present at position 329. This histidine is absolutely invariant in alignments of 
all known hh genes, and its sequence context suggests a catalytic role in auto-proteolysis 
(Lee, et aL, supra), 

FIGURE 17 shows zebrafish twiggy-winkle hedgehog derivatives. 17(a) Cartoons of 
various twhh open reading frames. SS (shaded) is the predicted N-terminal signal 
sequence for secretion of these proteins and encompasses the first 27 amino acids of each 
open reading frame. The arrow indicates the predicted internal site of auto-proteolytic 
cleavage. Amino acid residue numbers are according to Figure 13b. The filled triangle 
denotes the normal termination codon for the twhh open reading frame. Construct Uha 
contains a mutation that blocks auto-proteolysis (the histidine at residue 273 is changed 
to an alanine; see Lee, J.J., et al., supra.). Construct U3S6ha contains a stop codon in 
place of amino acid residue 357 as wdl as the H273A mutation in Uha- Construct N 
encodes just the first 200 amino acids of twhh. Construct C has had the codons for 
residues 31-197 deleted. 17(b) shows in vitro translation of the expression construas 
shown schematically in part a. Constructs were translated in vitro in the presence of "S 
methionine and analyzed by autoradiography after SDS-PAGE. The protein products are 
shown schematically to the left. Lanes 1 and 6: Auto-proteolysis of the full-length (Uss) 
protein creates two fragments, an N-terminal ftagment (Nss) and a C-terminal fragment 
(C). Lane 2: Construct Uha only makes an uncleaved form of twhh protein that 
comigrates with Uss twhh via auto-deavage. Lane S: Construct C encodes processed and 
unprocessed forms which are visible as two bands migrating closely together. The 
bottom band is the C protein made from auto-proteolysis of the Uss (^3 1-197). All 
constructs were made by in vitro mutagenesis of repression construct TlTStwhh (see 
FIGURE IS) using the method of RPCR. The sequence of all constructs were confirmed 
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10 



15 



20 



25 



by dideoxy sequencing. /» vitr© translations were perfonned according to manufacture^^ 
instnictions (Prom^). 

The vertebrate hh proteins encoded by shh, twhh and moMSt^hh/Hhg^J also undergo 
auto-proteolysis to yield two smaUer species from a single larger precursor (Lee. et al. 
supra; Chang, et al, supra, see lanes 1 and 6 in HGURE 17b). The invariant histidine 
to alanine mutation to generate a constnict encoding a form of the twhh protein that is not 
auto-proteolytically cleaved (Uha). We have also introduced a nonsense codon and 
deleted a segment of coding sequence to generate constructs that produce either the 
amino- or the caiboxy-terminal domains of twhh (N and C. respectively; see lanes 4 and 
5 in FIGURE 17b); constructs are schematically diagrammed in FIGURE 1 7a). To target 
these proteins to the secretory pathway, all constructe retained the normal twhh signal 
sequence. 

Synthetic mRNAs transcribed ftora these constructs were injected to examine the role of 
processing and to assay the activities of individual protein fragments; the resuhs are 
summarizedinTaWelandarebasedontheactivitiespresentedinFIGURElS. Themost 
striking conclusion from these experiments is that N and C both exhibit activity, and that 
these activities are distinguishable. Thus, although both N and C are capable of 
ectobpicaliy activating pax-2 in the developing eye. thereby providing an internal 
injection control, only N was capable of efficiently repressing /Jar-fi (HGURE 16). Uter 
effects on lens development were also more extreme for N. consistent with the role of 
pax-6 in lens developmem suggested by its mutant phenotypes in mice. (See Ton, C.C., 
e/a/..Ce// 62:1059-1074. 1991; Glaser, T.. et al.. Nat. Genetics g232-239, 1992. Hill. 
R.E., et al.. Nature 354:522-525. 1991; Hogan. B.L.. et al.. J. Embryol. Exp. Morph, 
22:95-110. 1986; and Hogan. B.L., e/a/.. Z)eve/o;?me/J/, lCiaSuppl.:115-119. 1988.) 

In considering the activity of delta N-C. it is important to recognize the activity of 
endogenous hh genes in these experimems. which are inhibited by delta N-C and 
fragments thereof (see Example 18 and FIGURE 18 for further discussion) 
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The uncleaved Uha protein is only somewhat less active than C in inducing pcDC-2, but it 
also was. not able to repress pax6 efficiently (FIGURE 16). The latter is particularly 
notable since the Uha protein (U3^ ; see FIGURE 17a, b) has activities not 
significantly different firom N (FIGURE 16). Thus, in addition to carrying determinants 

5 important for auto-proteolysis and /MDC-2induction, the C-terminus also contains a domain 
inhibitory to N-terminal function when in the context of the uncleaved hh protein. The 
C-terminu5 can also inhibit N action by an intermolecular mechanism (Lai, et al, supra). 
The existence of such an inhibitory domain in C suggests that if autoproteolyis can be 
modulated, such modulation might regulate the activity of hh in vivo. This possibility 

10 highlights the importance of ascertaining the processed state of hh proteins expressed in 
any particular patterning center to understand the potential hh activities generated. 
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RXAMPLE17 

miAI. ROLES OF STHNAMNG PROTEINS IN EARLY EYE 
AND RRAIN PATTERNING 

In understanding the normal roles of N and C in eye and brain patterning, the N and C 
$ derivatives of the Drosophila hh gene may oflFer some inaght. The Drosophila N 
derivative is retained close to its embryonic site of synthesis in a segmentally striped 
pattern (Tabataand Romberg, Cell, 26:89-102, 1994; Taylor, et al, Mech. Dev., 42 89- 
96, 1993), is cell-associated when expressed in cultured cells, and is effectively bound 
by h^Muin agarose in vitro, suggesting the possibility of extracellular matrix association. 
10 The C-terrainal fragment, in contrast, is not bound effectively by heparin agarose, is 
almost quantitatively released into the culture supernatant of expresang cultured cells, 
and is only diffusely localized in embryos. Although the activities of individual 
fragments have not been assayed, the biochemical diflFerences and tissue distributions of 
Drosophila N and C may account for the short and long range nature of the functions 
15 assodated with hh during Drosophila devdopment. 

Although the tissue distributions of zebrafidi N and C are not known, their activities in 
ectopic expression assays are also suggestive of short- and long-range &nctions when 
considered in the comext of normal expresaon patterns of hh, pax-2 and pax-6. The 
normal gradient of pax-2 expression in the optic vesicle extends a substantial distance 

20 from its maximum adjacent to the site oihh expression in the protuberance; the ability 
of ectopic C to activate pax-2 therefore suggests that, consistent with the distribution of 
C in Drosophila, zebrafish C n»y cany out a long-range function. Repression of 
endogenous pax-6 expresaon, in contrast, appears to be a short-range function since pax- 
6 expresaon occurs close to endogenous hh expression. EflBcient repression of pax-6 is 

25 an attribute of constructs producing N, and a short-range function for N would be 
consiaent with the diaribution of N in Drosophila. 
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Two types of M-dependent activity have been reported for A/i-transfected cultured cells. 
One is the apparent contact-dependent induction of floor plate markers (Roelink, H., el 
al. Cell 26:76U775, 1994); the second induction of sclerotome markers in presomitic 
mesoderm, is difiEusible and acts at long-range. 

5 EXAMPLE 18 

CHARACTERIZATION OF XENOPUS hh 



1 . Materials and Methods 

cDNAs encoding full-length Xnopus hedgehogs, or encoding amino terminal or carboxy 
10 terminal domains linked to secretory leader sequences were transcribed in vitro to yield 
translatable messenger RNA The synthetic messenger RN As, and control mRN As, were 
microinjected into the animal poles of cleavi^e stage Xenopus embryos, which were 
allowed to develop to the blastula stage, at which time the animal cap explants were 
prepared from the upper one fourth of the embryo. These blastula cap explants were then 
15 cultured in vitro in physiological saline in the presence or absence of the transforming 
growth factor beta family member, recombinant human activin A. All explants were 
allowed to develop until control embryos had grown to neurula stage, or to tadpole stage. 
Importantly, blastula caps left untreated differentiate from ectoderm into atypical 
epidermis. Blastula caps treated with activin differentiate into mesodermal and neural 
20 cell types. Thus, the question was whether hedgdiog, or its proteolytic derivatives, would 
change the diflFerentiation of cells away from becoming epidermis, and into another cell 
type. A secoiid question was whether hedgehog can work with activin to alter the normal 
response of the tissue to either factor by itself 

Explants were then extracted to yield mRNA by methods commonly used by those of 
25 skill in the art, which was used as template with reverse transcriptase to yield cDNA. The 
cDNA was then used as template with various sets of primers for PCR for specific genes, 
reverse-transcriptase-polymerase chain reaction, or RT-PCR. This results in specific 
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amplification of radioactive products i^ch are diagnostic for the presence and level of 
the messenger RNAs which were present in the explants. Samples were separated on 
polyaciylamide gels, which were exposed to X*ray film to yield the bands shown in the 
figures. Thus, the darker bands correspond to a greater level of the specific mRNA. 

5 FIGURE ISA and B demonstrate that hedgehog induces pituitary and anterior brain 
genes, and can cooperate with activin or with neural inducers such as noggin and 
foUistatin which are induced by activin to elevate expression of these genes in explanted 
embryonic tissue. All odd numbered lanes lack reverse transcriptase in the RT-PCR 
reaction and are negative controls. All even numbered lanes have this enzyme, and thus 

10 give specific bands to mRNA In Panel A, Lanes 1-2 are control blastula caps, lanes 3-4 
are Xenopus hedgehog-expressing blastula caps, lanes S-6 are control blastula caps 
treated with activin, lanes 7-8 are hedgehog-expressing blastula caps treated with activin, 
and 9-10 are prolactin-expressing blastula caps treated with activin to serve as a control 
for simply expressing a secreted protein in the blastula cap. The primers used for the 

15 assay are shown to the left of each panel, XAG 1 is a cement gland marker, XANFl B 
is a pituitary marko*, otx-A is an anterior brain marker, en-2 is a midbrain-hindbrain 
boundary marker, krox 20 is a rhombomere-specific hindbrain marker, HIHbox 6 is a 
posterior hindbrain marker, NCAM is a general neural marker, activin is a control for 
mesoderm induction, and elongation factor is a positive control to shown that all even 

20 numbered lanes did in fact have cDNA present. 

The panel labelled XANFIB detects a pituitary gene. Lane 4 (panel A) shows that 
hedgehog induces this pituitary marker, and thus likely pituitary cell types, in blastula cap 
explants (see also FIGURE 20, lane 6, for a stronger signal showing this), when 
compared to control explants in the absence of hedgehog (lane 2), which do not express 
25 this gene. Lane 6 shows that explants treated with activin, in the absence of hedgehog, 
also express the pituitary gene. Lane 8 shows that explants treated with both hedgehog, 
and with activin, give highest levels of the pituitary gene. Lane 10 proves that this effect 
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of hedgehog is specific, since prolactin, another secreted protein, does not lead to this 
elevated level of pituitary gene. 

The panel labelled OTX-A detects this anterior brain gene. Lane 4 (and 6 in Figure 20) 
shows that hedgehog can induce this neural-specific gene. Lane 8 shows that the level 
5 of this neural gene is highest in tissue treated with both activin and hedgehog, relative to 
hedgehog alone (lane 4), or activin along (lane 6), and control explants do not express this 
gene Qmc 2). Again, this effect is specific to hedgehog, since prolaain (lane 10) did not 
lead to elevated expression of this gene. The panel labelled XAG-1 detects a cement 
gland-specific gene, and lane 4 shows that hedgehog induces this gene at high level. 

10 In panel 1 8B, embryos wei^ injected with N or AN-C, and some animal cap explants were 
treated with activin before culturing until sibling embryos reached tailbud stage. Lanes 
1, 2: control animal caps from uninjected embryos. Lanes 3, 4: control animal caps fi-om 
uninjected embryos, treated with activin. Lanes 5, 6: animal caps fi-om embryos injected 
with N and treated with activin. Lanes 7, 8: animal caps fi-om embryos injected with AN- 

15 C and treated with activin. Whereas N displays activities in activin-treated explants 
similar to those of X-bhh (see B) AN-C produces the opposite effect, decreasing anterior 
and increasing posterior neural marker expression. As shown in Figure 18B, N behaves 
like X-bhh in that it induces elevated levels of XANF.2 and Otx-A (lane 6) relative to 
control activin-treated animal caps (lane 4). Moreover, N also leads to a decrease in the 

20 expression of more posterior markers, such as krox-20 and X/H6ox-6, as observed 
following injection of X-bhh In contrast to the activity of N (Fig. 4C, lane 6), AN-C 
decreases the expression of the anterior neural genes XANF-2 or Otx-A (Fig. 4C, lane 
8) in activin-treated animal caps when compared to uninjected controls (lane 4), 
Moreover, AN-C also leads to an increase in the expression of more posterior markers, 

25 such as £>i-2 and Xlhbox'6. 

FIGURE 19 shows X-bhh modifies the anteroposterior pattern of neural gene expression 
in explants under the influence of endogenous neural inducers. (A) Isolation of dorsal 
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explants from injected embryos for the preparation of Keller sandwiches (Keller and 
Danilchik, 1988; Doniach, ei aL, 1992; redrawn from Doniach, 1993). (B) KeUer 
sandwiches were made from uninjected O^nes 1 and 2) and X-6/i^injected (lanes 3 and 
4) embroys, total RNA was isolated i^^.control embryos reached stage 20, and RT-PCR 

5 was used to analyze the expression of XAG-1 and neural markers. XAG^l is a cement 
gland marker, XANF-2 is an anterior pituitary marker, Otx-A is a forebrain marker, En-2 
demarcates the midbrain-hindbrain boundary, Krox-20 marks rhombomeres 3 and 5 of 
the hindbrain and XIHboX'6 is a spinal cord marker. N-CAM is a general neural marker 
whose expression is not restricted along the anteroposterior axis. The EF-la control 

10 demonstrates that a comparable amount of RNA was assayed in each set. Note that 
expression of XAG-1 and anterior neural markers is stimulated by X-bhh treatment, 
whereas expression of posterior neural markers is suppressed. 

FIGURE 20 demonstration of differential activities of N and C domains of hedgehog 
proteins. As in FIGURE 18 above, odd numbered lanes are negative control lanes, and 

15 positive numbered lanes show specific gene expression for the markers described above. 
The N domain of hedgehog is encoded in the construct called Xhhl208 (lane 8), and the 
C domain is encoded in the construct called Xhhldeha 27-208 (lane 10). The construct 
Xhhll-1270A Qane 12) is specifically mutated so that it is unable to undergo self- 
processing. The ability of the N and C domains to induce the genes described above is 

20 compared to control blastula cap explants (lane 4), entire embryos as a positive control 
(lane 2), blastula cap explants expressing a mutated hedgehog as a negative control (lane 
14), blastula caps expressing the entire hedgehog 1 Gane 6), and blastula cap explants 
treated with an independent neural inducer, noggin (lane 16) (discovered by Richard 
Harland at University of California at Berkeley). 

25 Examining the first panel for the cement gland marker XAG-1 clearly shows that intact 
hedgehog (lane 6) and the N domain (lane 8) and the processing defective hedgehog Oane 
12) are much better than inducing the cement gland than is the C domain (lane 1). 
Examining the second panel demonstrates that the C domain (lane 10) is better at 
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inducing the pituitary gene XANFIB than is the N doniain Oane 8). Since the N domain 
induces the XAG-1 marker better, described in point A above, the two results together 
clearly demonstrate that the N and C donuuns have distinguishable activities. 
Examination of the remaining panels shows that all described activities of the normal 
hedgehog (lane 6 ) can be defined in terms of the activities of the N and C domain. 

Examining the third panel, for the forebrain gene otx-A, shows that both the N domain 
(lane 8) and C domain (lane 10) induce similar levels of this gene, but the processing 
defective hedgehog (lane 12) is better than either at inducing this gene. 

Examining the fourth panel of this figure (NCAM), (as well as the FIGURE 18 panels 
EN-2, lcrox20, XIHbox6, and NCAM), shows that hedgehog does not induces these more 
posterior neural genes. Notably, noggin (lane 16) is able to induce pituitary gene and 
forebrain g«e, but it also induces the general neural gene, NCAM, which hedgehog does 
not. This clearly shows that hedgehog is a distinct activity firom the neural inducer 
noggin, and has a more restricted ability to induce neural genes. 

Experiments in the Xenopus embryo were conducted by injecting fiilHength hedgehog 
RNA, and immunoprecipitating with a C-domain specific antibody, which proves that fiill 
length hedgehog does in fact get processed in vivo in vertebrates, consistent with the data 
shown in earlier Examples in Drosophila, Thus, the ideas for the utility of detecting 
hedgehog N and C domains is based on knowledge that such domains do appear through 
hedgehog processing in vertebrates. Moreover, the knowledge that hedgehog processing 
does occur in vivo naturally raised the question of whether the resulting N and C domains 
have independent activity. 

The resuhs in FIGURE 18 are novel insofiu- as they establish that the activity of hedgehog 
in inducing a pituitary gene, and an anterior brain gene, may be enhanced by the TGFP 
family of growth factors. This enhancement likely applies to the N and C domains 
described in FIGURE 20, since the genes analyzed are the same. This enhancement is due 
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to hh synergizing with neural inducing fectors which are themselves induced by TGF-p 
{amily members, including but not limited to such molecules as noggin and foUistatin. 

The data in FIGURE 20 makes several important points. First, the data show that the N 
and C domains have dififerent though somewhat overlapping activities, and that the N and 

5 C activities added together account for all of the observed activity of the intact hedgehog 
protein. Thus, any clinical or diagnostic uses of hedgehog might be improved by use of 
the N or C domain, as one generally wishes to use the smallest protein which has an 
activity for clinical work, as it is less likely to evoke adverse immune responses, or other 
adverse side effects. Second, the data show that the C domain is better than the N domain 

10 in inducing pituitary gene e7q)ression and, since it has less induction of cement gland 
genes that intact hedgehog, or N donuun, it suggests that the C domain might be useful 
in clinical situations where one wishes to enhance the development or expression of the 
pituitary as specifically as pos^ble. As the pituitary is the source of a number of 
hormones, any treatment for enhancing pituitary cell growth and activity would ideally 

15 have as few side effects as possible, and the C domain is thus a viable candidate for 
therapies with enhanced pituitary cell growth and function in mind. Third, relating to 
studies regarding noggin, FIGURE 20 shows clearly that while both hedgehog and noggin 
can induce pituitary gene expresdon, hedgehog is more specific, since hedgehog does not 
induce the general neural marker NCAM, whereas noggin induces NCAM as well as 

20 pituitary. Fourth, the hedgehog which was mutated to prevent processing (lane 12) is as 
active as full-length and wild-type hedgehog (lane 6) in inducing pituitary gene 
expression, but the processing defective hedgehog is better at inducing the forebrain 
marker otx-A. Thus, for some clinical applications of hedgehog in inducing specific cell 
types, it is possible that the processing*defective hedgehog will be superior compared to 

25 normal hedgehog. 

FIGURE 21 shows AN-C interferes with X-bhh and N activity in animal cap explants. 
Embryos were injected with various RNAs, animal cap explants were cultured until 
sibling embryos reached tailbud (stage 25), at which time RT-PCR was used to analyze 
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the expression of the cement gland marker XAG-1 and the control RNA, EF-la. Lanes 
1, 2: control animal caps from uninjected cmbiyos. Lanes 3, 4: animal caps from embryos 
injected with both X-bhh and prolactin RNAs. Lanes 5, 6: animal caps from embryos 
ii^ected with box X-bhh and AN-C. Lanes 7, 8: animal caps from embryos injected with 
both N and protecting RNAs. Lanes 9, 10: animal caps from embryos injected with both 
N and AN-C. The N and X-bhh experiments were conducted independently and thus 
absohrte levels in lanes 3-6 should not be compared to those in lanes 7-10. Note that the 
induction of XAG-1 expression by X-bhh or N is reduced by co-injection of AN-C. 

An internal deletion of X-bhh (aN-C) blocked the activity of X-bhh and N in explants and 
reduced dorsoanterior structures in embryos. As elevated hh activity increases the 
expression of anterior neural genes, and as aN-C reduces dorsoanterior structures, these 
complementaiy data support a role for hh in neural induction and anteroposterior 
patterning. 

aN-C ddetcs amino acids 28-194 of X-bhh. The primary translation product is predicted 
to undergo signal sequence cleavage removing amino acids 1-23, and to undergo 
autoproteolysis. Based on the cleavage site in Drosophila hh (Porter, et al. Nature, 
124:363» 1995) autoproteolysis would generate a C domain of X-bhh amino acids 198- 
409, as well as a predicted seven amino acid polypeptide, representing amino acids 24-27. 
and 195-197 (Lai, et al, Development 121:2349, 1995). Analysis of the effect of aN-C 
on nwral markers was by standard: methods including Northern blot analysis and in situ 
hybridization (Lai, et al., supra, incorporated herein by reference). 

Although aN-C does not induce the cement gland marker XAG-1, it decreases the 
expression of anterior ectodermal and neural markers in activin-treated animal caps. 
Thus, aN-C has the capacity to affect neural patterning. aN-C also promotes an increase 
25 in posterior neural markers in activin-treated animal caps. Mixing aN-C with N or full 
length X-bhh at a 1 : 1 ratio led to a dramatic inhibition of the induction of cement gland 
in animal cap assays, supporting the hypothesis that aN-C interfered with X-hh. 



15 
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The above disclosure generally describes the present invention, A more complete 
understanding can be obtained by reference to the following spedfic examples which are 
provided herein for purposes of illustration only and are not intended to limit the scope 
of the invention. 
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SEQUENCE LISTING 



(I) GENERAL INFORMATION: 
(i) APPLICANT; The Johns Hop)ciiis Xftiiversity School of Medicine, et al. 
(ii) TITLE OF INVENTION: NOVEL HEDGEHOG -DERIVED POLYPEPTIDES 
5 (iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS.: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 
10 (C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

{vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: PCT/US95 

(B) FILING DATE: Ol-DEC-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Haile, Lisa A. 
25 (B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 07265/043WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..142 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GTG AAA CTG CGG GTG ACC GAG CCC TGG GAC GAA GAT GGC CAC CAC TCA 
48 

Val Lys Leu Arg Val Thr Glu Pro Trp Asp Glu Asp Gly His His Ser 
15 1 5 10 " 

CAO GAG TCT CTG CAC TAC GAG GGC CGC GCA GTG GAC ATC ACC ACG TCT 
96 

Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser 
20 25 30 

20 GAC CGC GAC CGC AGC AAG TAC GGC ATG CTG GCC CGC CTG GCG GTG G 

142 

Asp Arg Asp Arg Ser Lys Tyr Gly Met Leu Ala Arg Leu Ala Val 
35 40 45 

AG 

25 144 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
35 (B) LOCATION: 1. .142 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTG AAG CTG CGG GTG ACC GAG GGC TGG GAC GAG GAC GGC CAC CAC TCA 
4B 

Val Lys Leu Arg Val Thr Glu Gly Trp Asp Glu Asp Gly His His Ser 
40 50 55 60 

GAG GAG TCC CTG CAT TAT GAG GGC CGC GCG GTG GAC ATC ACC ACA TCA 
96 

Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp He Thr Thr Ser 
65 70 75 
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GAC CGC GAC CGC AAT AAG TAT GGA CTG CTG GCG CX3C TTG GCA GTO G 

Asp Arg Asp Arg Asn Lys Tyr Gly I*eu Leu Ala Arg Leu Ala Val 
80 85 90 

5 AG 
144 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS ; not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

15 He Ser Ser His Val His Gly Cys Phe Thr Pro Glu Ser Thr 

15 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: pro.tein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

25 Ser He Ser His Met His Gly Cys Phe Thr Pro Glu Ser Thr 

15 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

35 Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr 

1 5 . 10 

(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 

Val Ala Ala Lys Ser Asp Gly Cys Phe Pro Gly Ser Ala Thr 
15 10 

10 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : hot relevant 
15 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu 
1 5 10 

20 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
25 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr 
1 S , : 10 

30 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
35 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Val Ala Ala Lye Ser Gly Gly Cye Phe Pro Ala Gly Ala Arg Thr 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 10: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : not relevant 

(D) TOPOLOGY: both 

10 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala Gin 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

20 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Ala Ala Lys Thr Gly Gly Cys Phe Pro Gly Glu Ala Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

30 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Gly Val Arg Ser Gly Gly Cys Phe Pro Gly Thr Ala Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

35 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOIjBCULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Leu Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

^0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

15 (ii) MOL.ECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

His Gly His Gly Cys Phe Thr Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

25 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Gly His Gly CyB Phe Thr Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

35 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 

hye Ser Gly Gly Cys Phe Pro Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO; 17: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Asp Val Arg l»eu His Leu Lys Gin Phe Ala Leu Leu Cys Phe He 
15 10 15 

Ser Leu Leu Leu Thr Pro Cys Gly Leu Ala Cys Gly Pro Gly Arg Gly 
20 25 30 

15 Tyr Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

Gin Phe He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Lys 
50 55 60 

Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu He 
20 6 5 7 0 7 5 8 0 

Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Asn 
85 90 95 

Ala Asp Arg Leu Met Thr Lys Arg Cys Lys Asp Lys Leu Asn Ser Leu 
100 105 110 

25 Ala He Ser Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

Glu Gly Trp Asp Glu Asp Gly His His Leu Glu Glu Ser Leu His Tyr 
130 135 140 

Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys 
30 145 150 155 160 

Tyr Gly Met Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 

165 170 175 

Tyr Tyr Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

35 Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr val 
195 200 205 

Thr Leu Gly Asp Gly Thr Arg Lys Pro He Lys Asp Leu Lys Val Gly 
210 215 220 

Asp Arg Val Leu Ala Ala Asp Glu Lys Gly Asn Val Leu He Ser Asp 
40 225 230 235 240 
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Phe lie Met Phe lie Asp His Asp Pro Thr Thr Arg Arg Gin Phe He 
245 250 255 

Val He Glu Thr Ser Glu Pro Phe Thr Lys Leu Thr Leu Thr Ala Ala 
260 * 265 270 

5 His Leu Val Phe Val Gly Asn Ser Ser Ala Ala Ser Gly He Thr Ala 
275 280 285 

Thr Phe Ala Ser Asn Val Lys Pro Gly Asp Thr Val Leu Val Trp Glu 
290 295 300 

Asp Thr Cys Glu Ser Leu Lys Ser Val Thr Val Lys Arg He Tyr Thr 
10 305 310 315 320 

Glu Glu His Glu Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He 
325 330 335 

He Val Asp Gin Val Leu Ala Ser Cys Tyr Ala Val He Glu Asn His 
340 345 350 

15 Lys Trp Ala His Trp Ala Phe Ala Pro Val Arg Leu Cys His Lys Leu 
355 360 365 

Met Thr Trp Leu Phe Pro Ala Arg Glu Ser Asn Val Asn Phe Gin Glu 

370 375 380 

Asp Gly He His Trp Tyr Ser Asn Met Leu Phe His He Gly Ser Trp 
20 385 390 395 400 

Leu Leu Asp Arg Asp Ser Phe His Pro Leu Gly He Leu His Leu Ser 
405 410 415 

(2) INFORMATION FOR SSQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 41B amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18; 

30 Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
15 10 15 

Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe He 
35 35 40 45 

Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

40 Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 

85 90 95 
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10 



15 



20 



30 



35 



Leu Met Thr 

Val Met Asn 
115 

Asp Glu Asp 
130 

Ala Val Asp 
145 

Leu Ser Arg 

Ser Lys Ala 

Ala Lys Ser 
195 

Asp Gly Gly 
210 

Leu Ala Ala 

225 

Phe Thr Asp 

Thr Gin Glu 

Phe Val Leu 
275 

Tyr Ala Ser 
290 

Ser Gly Gin 
305 

Gin Arg Gly 
Asp Arg lie 

Ala His Leu 

355 

Phe Leu Phe 
370 

Gin Glu Gly 
385 

Trp Leu Leu 
Ser Ser 



Gin Arg 
100 

His Trp 

Gly His 

He Thr 

Leu Ala 
165 

His He 
180 

Gly Gly 

Gin Lys 

Asp Ser 

Arg Asp 
245 

Pro Val 
260 

Asp Asn 
Ser Val 
Leu Lys 

Ser Phe 

325 

Leu Ala 
340 

Ala Phe 
Pro Gin 
Val His 



Asp Ser 
405 



Cys Lys Asp 

Pro Gly Val 
120 

His Phe Glu 
135 

Thr Ser Asp 
150 

Val Glu Ala 
His Cys Ser 

Cys Phe Pro 

200 

Ala Val Lys 
215 

Ala Gly Asn 

230 

Ser Thr Thr 

Glu Lys lie 

Ser Thr Glu 
280 

Arg Ala Gly 
295 

Ser Val He 
310 

Ala Pro Val 
Ser Cys Tyr 



Ala Pro Ala 

360 

Asn Ser Ser 
375 

Trp Tyr Ser 
390 

Asn Met Leu 



Lys Leu 
105 

Lys Leu 
Glu Ser 
Arg Asp 
Gly Phe 

170 

Val Lys 
185 

Gly Ser 

Asp Leu 

Leu Val 

Arg Arg 
250 

Thr Leu 
265 

Asp Leu 

Gin Lys 

Val Gin 

Thr Ala 
330 

Ala val 
345 

Arg Leu 
Ser Arg 
Arg Leu 



His Pro 
410 



Asn Ser 

Arg Val 

Leu His 
140 

Lys Ser 
155 

Asp Trp 

Ala Glu 

Ala Leu 

Asn Pro 
220 

Phe Ser 

235 

Val Phe 

Thr Ala 

His Thr 

Val Met 
300 

Arg He 
315 

His Gly 
He Glu 
Tyr Tyr 



Ser Asn 
380 

Leu Tyr 
395 



Leu Ala He Ser 
110 

Thr Glu Gly Trp 
125 

Tyr Glu Gly Arg 



Lys Tyr Gly Thr 
160 

Val Tyr Tyr Glu 
175 

Asn Ser Val Ala 
190 

Val Ser Leu Gin 
205 

Gly Asp Lys Val 



Asp Phe He Met 
240 

Tyr Val He Glu 
255 

Ala His Leu Leu 
270 

Met Thr Ala Ala 
285 

Val Val Asp Asp 



Tyr Thr Glu Glu 
320 

Thr He Val Val 
335 

Asp Gin Gly Leu 
350 

Tyr Val Ser Ser 

365 

Ala Thr Leu Gin 



Gin Met Gly Thr 
400 



Leu Gly Met Ser Val Asn 
415 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Val Glu Met Leu Leu Leu Thr Arg lie Leu Leu Val Gly Phe He 
10 1 5 10 « 

Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

He Gly His Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

15 Gin Phe He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 

50 55 60 

Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu He 
65 70 75 80 

Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly 
20 8 5 90 95 

Ala Asp Arg Leu Met Thr Cys Arg Cys Lys Asp Lys Leu Asn Ala Leu 
*^ 100 105 HO 

Ala He Ser Val Met Asn Cys Trp Pro Gly Val Met Leu Arg Val Thr 
115 120 125 

25 Glu Gly Trp Asp Glu Asp Gly His. His Ser Lys Glu Ser Leu His Tyr 

130 135 "0 

Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 1^0 

Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
30 165 170 175 

Tyr Tyr Glu Ser Lys Ala His He Cys Ser Val Lys Ala Glu Asn Ser 
180 185 190 

val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His 
195 200 ' 205 

35 Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser His Gly Asp 

210 215 220 

Arg Val Leu Ala Ala Asp Ala Asp Gly Arg Leu Leu Val Ser Asp Phe 
225 230 235 240 

Leu Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 
40 245 250 255 
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Val He Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

5 Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 

290 295 300 

Pro Val Val Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
305 310 315 320 

Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 
10 325 330 335 

Thr Thr Ala Cys Gly Thr He Leu He Asn Arg Val Leu Ala Ser Cys 
340 345 350 

Tyr Ala Val He Glu Glu His Ser Trp Ala His Ala Ala Phe Ala Pro 
355 360 365 

15 His Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 

370 375 380 

He Pro Thr Ala Ala Thr Thr Thr Thr Gly He His Trp Tyr Ser Arg 
385 390 395 400 

Leu Leu Tyr Arg He Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 
20 405 410 415 

Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

30 Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val He Leu Ala Ser Ser 
15 10 15 

Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 35 .40 45 

He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn 
65 70 75 80 

40 Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 

85 90 95 
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10 



15 



20 



25 



30 



35 



Arg Ijeu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie 
100 105 110 

Ser Val Met Asn Gin Trp Pro Oly Val Lys Leu Arg Val Thr Glu Oly 
115 120 125 

Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 190 

Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 • 

Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val He 
245 250 255 

Glu Thr Leu Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 265 270 

Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 

Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
305 310 315 320 

Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 

325 330 335 

His Gly Thr He Leu He Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 

He Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 

355 360 365 

Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 

Gly Gly Gly Ser He Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
385 390 395 400 
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Ala Glu Pro Thr Ala Gly He His Trp Tyr Ser Gin Leu Leu Tyr His 
405 410 415 

He Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 



Ala Val Lys Ser Ser 
435 
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CLAIMS 

1 A substantiaUy pure polypeptide characterized by having an amino acid sequence 
derived from amino terminal amino adds of a hedgehog protein and having at its 
carboxy terminus, a GICF cleavage site spedficaUy recognized by a proteolytic 
activity of tiie carboxy teiminal fragment of the native hedgehog polypeptide. 

2. An isolated polynucleotide sequence encoding a polypeptide having an amino 
acid sequence of claim 1. 

3 . The isolated polynucleotide sequence of claim 2. encoding a polypeptide having 
an amino acid sequence of ti»e polypeptide of claim 1 and having at least one 
epitope for an antibody iramunoreactive with tiie polypeptide of claim 1 

4. A recombinam expression vector which contains tiie polynucleotide of claim 2. 

5. A host cell whidj contains the expression vwtor of daim 4. 

6. An antibody which binds to the polypeptide of claim 1 and which binds witii 
immunoreactive fragments of the polypeptide of claim 1 . 

7. The antibody of claim 6, wherein the antibody.is polyclonal. 

8. The antibody of daim 6, wherein the antibody is monoclonal. 
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9. A substantially pure polypeptide characterized by having an amino acid sequence 
derived from carboxy terminal amino acids of a hedgehog protein and having at 
its amino terminus, a Gl CF cleavage site specifically recognized by a proteolytic 
activity of the carboxy terminal fragment of the native hedgehog polypeptide. 

10. An isolated polynucleotide sequence encoding a polypeptide having an amino 
acid sequence of claim 9. 

1 1 . The isolated polynucleotide sequence of claim 10, encoding a polypeptide having 
an amino acid sequence of the polypeptide of claim 9 and having at least one 
epitope for an antibody immunoreactive with the polypeptide of claim 9. 

12. A recombinant expres»on vector which contains the polynucleotide of claim 1 0. 

13. A host cell which contains the expression vector of claim 12. 

14. An antibody which binds to the polypeptide of claim 9 and which binds with 
immunoreactive fragments of the polypeptide of claim 9. 

15. The antibody of claim 14, wherein the antibody is polyclonal. 

16. The antibody of claim 14, wherein the antibody is monoclonal. 

17. A method for modulating proliferation or differentiation of neuronal cells, 
comprising contacting the cells with a hedgehog polypeptide. 

18. The method of claim 1 7, wherein the hedgehog polypeptide is the polypeptide of 
claim 1. 
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19. The method of claim 17, wherein the hedgehog polypeptide is the polypeptide of 
daim9. 

20. The method of claim 17, wherein the modulation is induction of prolifiwation or 
differentiation. 

21 . The method of claim 17, wherein the neuronal ceUs are substantially derived from 
floor plate neuronal cells. 

22. The method of claim 2 1 , wherein the neuronal cells are motor neurons. 

23. The method of claim 17, wherein the neuronal cells are vertebrate neuronal cells. 

24. The method of claim 23 , wherdn the vertebrate is a human. 

25. The method of claim 17. fiirther comprising contacting the cell with a member of 
the TGF-p family. 

26. The method of claim 25, wherein the TGF-p is activin. 

27. The method of claim 17, further comprising contacting the cell with a neural 
inducing agent. 

28. The method of claim 27, wherein the neural inducing agent is selected from the 
group consisting of noggin, foUistatin. and a neurotrophin. 

29. An autoproteolytic fusion protein comprising a first polypeptide including the 
proteolytic domain of the polypeptide of claim 9, a cleavage site recognized by 
the first polypeptide, and a second polypeptide. 
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30. A polynucleotide encoding the fusion protein of claim 29. 

31. A method for producing an autoproteolytic fusion protein comprising operably 
linking a first polynucleotide oicoding a first polypeptide including the 
proteolytic domain of the polypeptide of claim 9 and the cleavage site recognized 
by the proteolytic domain, and a second polynucleotide encoding a second 
polypeptide. 
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FIGURE IE 
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FIGURE 4 
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